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TITLE 

Profiling and Identification of Television Viewers 

This application claims priority under 35 U.S.C. §119(e) for Provisional application 
numbers 60/260,946 (Docket # T734-00) filed on January 11, 2001 and 60/263,095 (Docket # 
T735-00) filed on January 19, 2001. Both of the aforementioned applications are herein 
incorporation by reference, but are not admitted to be prior art. 

This application is related to all of the AppUcants co-pending applications, patents and 
publications listed below. The entire list below is herein incorporated in their entirety but are not 
admitted to be prior art: 

• U.S. Application number 09/204,888, filed on December 3, 1998 entitled 
"Subscriber Characterization System" (Docket No. T702-00); 

• PCT Application PCT/US99/28528 (WIPO Publication WO 00/33 160A3), filed 
on December 2, 1999 entitled "Subscriber Characterization and Advertisement 
Monitoring System" (Docket No. T702-00PCT); 

• U.S. Patent Application number 09/516,983, filed on March 1, 2000 entitled 
"Subscriber Characterization with Filters" (Docket No. T702-02); 

• U.S. Patent AppHcation number 09/591,577, filed on June 9, 2000 entitled 
"Privacy-Protected Advertising System" (Docket No. T702-03); 

• U.S. Patent Application number 09/635,252, filed on August 9, 2000 entitled 
"Subscriber Characterization Based on Electronic Program Guide Data" (Docket 
NO.T702-04); ^ 

• U.S. Application number 09/205,653, filed on December 3, 1998 entitkd "Client- 
Server Based Subscriber Characterization System" (Docket No. T703-00); 

• U.S. Application number 09/205,119, filed on December 3, 1998 entitled 
"Advertisement Monitoring System" (Docket No. T704-00); 
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• U.S. Application number 09/516,314, filed on March 1, 2000 entitled 
"Advertisement Monitoring and Feedback System" (Docket No. T704-01); 

• U.S. Application number 09/452,893, filed on December 2, 1999 entitled 
"Subscriber Identification System" (Docket No. T705-01); 

5 • PCT Application PCT/US99/28600 (WIPO Publication WO 00/33233), filed on 

December 2, 1999 entitled "Subscriber Identification System" (Docket No. T705- 
OIPCT); 

• U.S. Patent Application number 09/635,253, filed on August 9, 2000 entitled 
"Subscriber Identification Based on Electronic Program Guide Data" (Docket No. 

10 T705-02); 

^ m U.S. Patent number 6,298,348, issued on October 2, 2001 entitled "Consumer 

«J Profiling System" (Docket No. T706-00); 

5 • PCT Application PCT/US99/28628 (WIPO Publication WO 00/33228), filed on 

HJ December 2, 1999 entitled "Consumer Profiling and Advertisement Selection 

15 System" (Docket No. T706-00PCT); 

□ • U.S. Patent Application number 09/553,637, filed on April 20, 2000 entitled 

"Advertising Management System for Digital Video Streams" (Docket No. T708- 
O 01); 

• PCT Application PCT/USOO/10633 (WIPO Publication WO 00/64165), filed on 
2 0 April 20, 2000 entitled "Advertising Management System for Digital Video 

Streams" (Docket No. T708-01PCT); 

• U.S. Patent Application number 09/553,099, filed on ^pril 20, 2000 entitled 
"Advertisement Insertion Techniques for Digital Video Streams" (Docket No. 
T708-02); 

25 •U.S. Patent Application number 09/568,084, filed on May 10, 2000 entitled 

"Advertisement Subgroups for Digital Video Streams" (Atty. Docket No. T7 1 1 - 
01); 
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• PCX Application PCT/USOO/12710 (WO Publication WO 00/69163), filed on 
May 10, 2000 entitled "Advertisement Subgroups for Digital Video Streams" 
(Docket No. T711-01PCT); 

• U.S. Patent Application number 09/568,477, filed on May 10, 2000 entitled 

5 "Advertisement Subgroups for Digital Video Streams" (Docket No. T7 1 1-02); 

• U.S. Patent Application number 09/635,539, filed on August 10, 2000 entitled 
"Delivering targeted advertisements in cable-based networks" (Docket No. T7 11- 
03); 

• U.S. Patent Application number 09/694,848, filed on October 20, 2000 entitled 
L 0 "Method and Apparatus for Inserting Digital Media Advertisements into 

O Statistical Multiplexed Streams" (Docket No. T712-10); 

f • PCT Application PCT/USOO/29034 (WIPO Publication WO 01/30086A), filed on 

i5i October 20, 2000 entitled "Method and Apparatus for Inserting Digital Media 

r\ Advertisements into Statistical Multiplexed Streams" (Docket No. T7 1 2- 1 OPCT); 

iJI • U.S. Patent Application number 09/796,339, filed on February 28, 2001 entitled 

jf "Privacy-Protected Targeting System" (Docket No. T7 15-10); 

]j • PCT Application PCT/US01/06650(WIPO Publication WO 01/65453A1), filed 

O on February 28, 2001 entitled "Privacy-Protected Targeting System" GDocket No. 

T715-10PCT); 

20 • U.S. Patent Application number 09/635,542, filed on August 10, 2000 entitled 

"Grouping Subscribers Based on Demographic Data" (Docket No. T719-00); 

• U.S. Patent Application number 09/635,544 filed on ^gast 10, 2000 entitled 
"Transporting Ad Characterization Vectors" (Docket No. T720-00); ^ ^ - . 

• U.S. Patent Application number 09/7 12,790 filed on November 14, 2000 entitled 
2 5 "Queue Based Advertisement ScheduUng and Sales" (Docket No. T721-10); 

• PCT Apphcation PCT/USOl/27217, filed on August 3 1, 2001 entitled "Targeted 
Advertising at the Set-Top Box" (Docket No. T721-10PCT); 
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• U.S. Patent Application number 09/731,606 filed on December 7, 2000 entitled 
"System for Providing Targeted Advertisements Using Advertiser Specific Target 
Groups " (Docket No. T721-12); 

• U.S . Patent Application number 09/748 ,949 filed on December 27, 2000 entitled 
"Advertisement Filtering and Storage for Targeted Advertisement Systems" 
(Docket No. T721-13); 

• U.S. Patent Application number 09/742,527 filed on December 21, 2000 entitled 
"System and Method for Automatically Managing Avail Inventory Data and 
Avail Pricing" (Docket No. T721-14); 

• U.S. Patent Application number 09/748,942 filed on December 27, 2000 entitled 
"Advertisement Distribution System for Distributing Targeted Advertisements in 
Television Systems" (Docket No. T721-15); 

• U.S. Patent Application number 09/742,506 filed on December 21, 2000 entitled 
"Internet Based Electronic Program Guide Advertisement Insertion Method and 
Apparatus" (Docket No. T721-16); 

• U.S . Patent Application number 09/748,943 filed on December 27, 2000 entitled 
"Delivering Targeted Advertisements to the Set-Top Box" (Docket No. T721-17); 

• U.S. Patent Application number 09/742,534 filed on December 21, 2000 entitled 
"Queue Based Head-End Advertisement Scheduling Method and Apparatus" 
(Docket No. T721-18); 

• U.S. Patent Application number 09/742,852 filed on December 21 , 2000 entitled 
"System for Rescheduling and Inserting Advertisements" (Docket No. T721-19); 

• U.S . Patent Apphcation number 09/750,800 filed on December 28, 2000 entitled 
"System and Method for Delivering Targeted Advertisements Using Multiple 
Presentation Streams" (Docket No. T721-20); 

• U.S. Patent Application number 09/766,004 filed on January 19, 2001 entitled 
"System and Method for Delivering Statistically Scheduled Advertisements" 
(Docket No. T721-21); 
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• U.S. Patent Application number 09/824,434 filed on April 2, 2001 entitled 
"Grouping of Advertisements on an Advertising Channel in a Targeted 
Advertising System" (Docket No. T721-22); 

• U.S. Patent Application number 09/658,204 filed on September 8, 2000 entitled 
5 "Targeted Advertising Through the Electronic Program Guide" (Docket No. 

T723-00); 

• U.S. Provisional Application number 60/238,059, filed on October 5, 2000 
entitled "Platform Independent Addressable Television Advertising System" 
(Atty. Docket No. T726-00); 

ID • U.S . Provisional Application number 60/23 8,056, filed on October 5 , 2000 

entitled "Method and System for Addressable Advertising in the Electronic 
g Program Guide" (Atty. Docket No. T727-00); 

;0 • U.S. Patent Application number 09/749,255 filed on December 27, 2000 entitled 

yQ "Scheduling and Linking IPG Ads in Conjunction with Programming Ads in a 

Television Environment" (Docket No. T727-10); 

Jj • PCT Application PCT/USOl/31682, filed on October 5, 2001 entitled 'Targeting 

Ads on IPG' s Live Programming and Recorded Programming, and Coordinating 
the Ads therebetween" (Docket No. T727-10PCT) 

• U.S. Patent AppUcation number 09/680,622 filed on October 6, 2000 entitled 
2 0 "Method and System for Addressable and Program Independent Advertising 

During Recorded Programs" (Docket No. T728-10); 

• U.S. Patent Application number 09/751,349 filed on December 28, 2000 entitled 
"Inserting Local Signals During MPEG Channel Changes" (Docket No. T730- 
10); ^ " 

25 • U.S. Provisional Application number 60/267,370, filed on February 8, 2001 

entitled "Presentation Stream Switching Using Channel Maps" (Docket No. 
T736-00); 



Patent Application 



-5- 



T734-10 



• U.S. Provisional Application number 60/278,612, filed on March 26, 2001 
entitled "Formation and Utilization of Cable Microzones" (Docket No. T737-00); 

• U.S. Provisional Application number 60/281,037, filed on April 3, 2001 entitled 
"Personal Video Recorder (PVR) Market Overview and Advertising 
Opportunities" (Docket No. T738-00); 

• U.S. Provisional Application number 60/329,992, filed on October 17, 2001 
entitled "Personal Video Recorder (PVR) System Requirements and 
Specification" (Docket No. T738-01); 

• U.S. Patent Application number 09/928,024 on August 10, 2001 entitled 
'Targeting Ads to Subscribers based on Privacy Protected Subscriber Profiles" 
(Docket No. T741-10); and 

• PCX Application PCT/USOl/25261, filed on August 10, 2001 entitied "Targeting 
Ads to Subscribers based on Privacy Protected Subscriber Profiles" (Docket No. 
T741-10PCT). 



Background of the Invention 

Advertising forms an important part of broadcast programming including broadcast video 
(television), radio and printed media. The revenues generated from advertisers subsidize and in 
some cases pay entirely for programming received by subscribers. For example, over the air 
broadcast programming (non-cable television) is provided entirely free to the subscribers and is 
essentially paid for by the advertisements placed in the shows that are watched. Even in cable 
television systems and satellite-based systems, the revenues from advertisements subsidize the 
cost of the programming, and were it not for advertisements, the monthly subscription rates for 
cable television would be many times higher than at present. Radio similarly offers free ^ 
programming based on payments for advertising. The low cost of newspapers and magazines is 
based on the subsidization of the cost of reporting, printing and distribution from the advertising 
revenues. 

Along with the multitude of programming choices that the television viewer faces, the 
viewers are subject to advertisements. While advertisements are sometimes beneficial to 
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subscribers and deliver desired information regarding specific products or services, consumers 
generally view advertising as a "necessary evil" for broadcast-type entertainment. A prior art 
(present model) of providing advertisements along with actual progranoming is based on linked 
sponsorship. In the linked sponsorship model, the advertisements are inserted into the actual 
programming based on the contents of the programming, e.g., a baby stroller advertisement may 
be inserted into a parenting program. Even with linked sponsorship, advertising, and in 
particular broadcast television advertising, is mostly ineffective. That is, a large percentage, if 
not the majority, of advertisements do not have a high probability of affecting a sale. In addition 
to this fact, many advertisements are not even seen/heard by the subscriber who may mute the 
sound, change channels, or simply leave the room during a commercial break. The reasons for 
such ineffectiveness are due to the fact that the displayed advertisements are not targeted to the 
subscribers' needs, likes or preferences. Generally, the same advertisements are displayed to all 
the subscribers irrespective of the needs and preferences of the subscribers. 

In order to deliver more targeted programming and advertising to subscribers, it is 
necessary to understand their likes and disUkes to a greater extent than is presently done today. 
Targeting of an ad requires knowing certain attributes of the target viewer, demographic, 
psychograph, and any data relevant to determining the relative appropriateness of an ad for the 
particular viewer. Systems which identify subscriber preferences based on their purchases and 
responses to questionnaires allow for the targeted marketing of literature in the mail, but do not 
in any sense allow for the rapid and precise delivery of progranaming and advertising which is 
known to have a high probability of acceptance to the subscriber. In order to determine which 
programming or advertising is appropriate for the subscriber, knowledge of that subscriber and 
of the subscriber's programming preferences is required. Characterizing or profiling viewers 
based on viewing habits may be used to achieve targeted advertising. 

Methods for monitoring the viewing habits of television viewers, for classifying TV 

. -sea 

programming into categories, and for using the viewing habits for determining viewing 
preferences have been previously disclosed. For example 

• Bedard (WIPO publication WO 98/21 877A2 ), assigned to Hyundai Electronics of San 
Jose, California, discloses a method for monitoring a viewer's viewing habits in order 
to arrange an electronic program guide (EPG) and determine viewing preferences; 
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. Barton et al. (WIPO publication WO 00/59223), assigned to TIVO, Inc. of Alviso, 
California, discloses a data storage and scheduling system in a personal video 
recorder (PVR). Based on past viewing habits/preferences, the PVR automatically 
records desired programs; and 

• Maissel et al. (WIPO publication WO 99/01984A1), assigned to NDS Limited of 
Middlesex, England, discloses a method of customizing the EPG by monitoring 
viewing behavior to create a preference profile. 

A method for delivering targeted ads to different demographic groups in a television 
environment was disclosed by Wachob (U.S. Patent No. 5,155,591), assigned to General 
Instrument of Hatboro, Pennsylvania. Wachob discloses a cable television system for 
broadcasting different commercial messages to different demographically targeted audiences. 
Demographic information is obtained and targeted audiences are formed based on subscriber 
address (i.e., geographic location) or on household survey information such as a viewing habit 
diary kept by the subscriber. 

Methods for delivering advertising or other customized programming to viewers in a 
television environment based on previous viewing habits or menu selections has previously been 
disclosed. For example, Despain et al. (WIPO publication WO 00/14951), assigned to Next 
Century Media, Inc. of New Paltz, New York, discloses a system for targeted advertising in a 
digital system. In the system, a digital set-top box (STB) captures and uploads household data 
viewing preferences to the cable operator head end, which can then be used to deliver targeted 
ads and other content to the viewer based on the viewer preferences. An on-screen questionnaire 
is used to eUcit demographic attributes and preferences from viewers. "Boo" and "applause" 
buttons on the remote control are used by the viewer to indicate viewer likes/dislikes. Channel 
change data can be captured and sent upstream, and in conjunction with Bata from the 
questionnaires and interactive buttons, may be used to provide each viewer with their^wr^ 
custom menu and forms of customized programming. However, Despain et al. do not teach 
local storing and processing of data to generate a viewer profile to be stored and utilized at the 
STB. Moreover, it does not teach specific methods on how to create a demographic, 
psychographic or other viewer profile from a multitude of viewer interaction data or how to 
correlate or use those profiles for the delivery of targeted advertisements. 
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A preference agent for monitoring television programs watched by a viewer is disclosed 
in Gogoi et al. (WIPO publication WO 99/65237), assigned to Metabyte, Inc. of Freemont, 
California. The preference agent, located within a STB or PVR, also retrieves category 
information about viewed programs, and generates a viewer program preference profile. The 
5 preference agent automatically records or suggests programs of interest to the viewer based on 
the viewer's program preference profile. 

The creation of user profiles on an interactive computer is disclosed in Freeman et al. 
(U.S. Patent No. 5,861,881), assigned to ACTV Inc., of Freeman, New York. The profiles are 
based on the selections made during one of the interactive programs, however, are limited to 
1 0 interactive program activity by the viewer, and are not based on general viewing habits or 
general surf activity. 

i|3 Hendricks et al. (U.S. Patent No. 6,160,989), assigned to Discovery Cominunications 

j3 Inc., of Bethesda, Maryland, discloses a network controller that provides monitoring and control 
■■'f of STBs. The network controller also gathers data received from the STBs to compile subscriber 
W viewing information and programs watched information. The data is processed to generate 
'^'^ packages of advertisements, as well as account and billing reports, targeted towards each STB. 
iss build a personal profile, the viewer answers a series of questions presented on a series of 
y questionnaire menu screens. 

0 Barton et al. (WIPO publication WO 00/59223 ), assigned to TIVO Inc. of Alviso, 

20 California, disclose a data storage and scheduling system, wherein viewing preferences can be 
inferred from viewing patterns, and where the navigation actions of the TV channels by the 
viewer are recorded, stored, and then sent upstream. However, Barton et al. do not teach or 
suggest local profiling or viewer identification, or how profiles could be utiUzed to delivery 
targeted advertising. i 

2 5 For the foregoing reasons, a need exists for a method and system for monitoring click- 

stream and other interactivity of a viewer with the viewer's television viewing envirormient and 
generating one more viewer profiles therefrom. Additionally, a need exists for the monitoring of 
interactivity and generation of viewer profiles to be performed within the television viewing 
environment (i.e., TV, STB, PVR). Furthermore, a need exists for such profiling to be done in a 

3 0 secure and privacy-protected manner. Moreover, a need exists for a reliable way of 



Patent Application 



-9- 



T734-10 



automatically, detecting or inferring, which specific individual or individuals, are actually 
watching the TV in a household comprising more than one individuals at a particular time, and 
for generating one or more profiles per each individual. 

5 Summary of the Invention 

The invention comprises a method of characterizing or profiling one or more viewer's, by 
monitoring and processing at the viewer set-top or receiver, each viewer's interactivity (e.g., via 
a remote control unit) with the set-top receiver, and then generating one or more profiles for each 
viewer based on one or more of the multitude of interactions of each viewer and on, in general, 
1 0 the viewing habits and preferences of the viewer. The invention further comprises a method of 
automatically and reliably, detecting or inferring at a particular time, which specific individual or 
0 individuals, are actually watching (i.e., interacting with the remote) the TV in household 
yji comprising more than one individual. Such viewer identification and profile generation can be 
■ used to facilitate the delivery of targeted content including targeted advertising. 

1;| The invention also comprises a system for carrying out the local (i.e., at the set-top) 

s profiling of viewers based on their viewing preferences and other interactions at the set-top and 
f =1 for distinguishing between viewers in a multi-viewer household. The invention further 

comprises a profiler or profiling agent, resident at the viewer's receiver or set-top, and whereby 
P the click-stream and other interactivity by a viewer can be monitored and processed to generate 
2 0 one more viewer profiles and for distinguishing between viewers in a multi- viewer household. 

In a preferred embodiment, the profiling agent is resident at a viewer's receiver and is 
responsible for profiling a single household, and distinct individual members of that household. 
The set-top box profiler monitors the viewing behavior and other receiver interactivity of the 
viewers including their viewing preferences and utilizing data about programming viewed or not 

2 5 viewed, derives characteristics about and generates profiles of the household and of individual 

viewers within the household. In a preferred embodiment, the profiles are comprised of profile 
categories, including the categories of preferred programs, preferred networks, viewing duration, 
channel change frequency, and holding factor per program or program category. The program 
data may be delivered to the set-top box periodically (e.g., on a carousel) or may be delivered 

3 0 with the video (e.g., in the VBI or in the PSI). 
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In a preferred embodiment, the profiler identifies different viewing sessions, a period of 
time during which viewership is static and does not change (i.e., a session of one or more 
specific viewers), based on viewing and other habits and preferences exhibited by the viewer or 
viewers, and creates session profiles for each session. Signature profiles are created from one or 
5 more correlated session profiles, and subsequent session profiles are correlated and matched with 
existing signature profiles. Sessions (or session durations) may be defined by, for example, the 
cycling of power (on/off) of the set-top box, a specific window of time or day-part, periods of 
inactivity, and monitoring channel change, viewing and other viewer activity. 

In one embodiment, the profiler comprises an event queue that stores viewer interactivity 
10 as viewer-generated events. The profile engine accepts events from the event queue, reads 
i"i database information (e.g., program data), and processes the events to produce the subscriber 
yi profiles. Events are dispatched to the profiling engine based on the clock time, and each of these 
5^ events may be used to update and modify the viewer's profile. In a preferred embodiment the 
a3 profile engine uses filters to process events. Eeich filter may handle one or more profiles 
IjS elements, determining whether or not each specific event is applicable to (i.e., should be used to 
update) one or more profiles. 

•"1 In one embodiment, automatic session detection uses channel change and other 

H information to dynamically determine which viewer is watching television at any given time. In 
1,1 a preferred embodiment, reliable viewer identification is accomplished and more complete, 
2 0 updated and accurate profiles of particular viewers are generated, by combining current session 
data with historical data for those viewers. This historical data, or signature, is an aggregation of 
session data for a particular viewer or type of viewer, and reflects a set of viewing and 
interactivity habits. During a specific session, as the profiler collects viewer interactivity session 
data, the profiler continuously correlates the session data with existing signature data in order to 
25 match the session data to a specific signature. This correlation associates long term viewing , 
habits with a particular user based on their short term viewing habits. Moreover, matching 
session data with a signature identifies a particular viewer and an associated profile. 

In one embodiment, the signature profile represents an individual viewer. In an alternate 
embodiment, the signature profile represents a plurality of household viewers. In another 
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embodiment, the signature profile is a standard profile, created independently of any session data 
or created from aggregated session data from a plurality of household. 

Default or standard signature profiles, comprising standard profile categories may be 
generated, independent of local viewing sessions, and may be downloaded to the local receiver 
5 or set top. Local viewing session profiles are correlated with these standard signature profiles to 
identify one or more particular attributes of the session viewer. 

Advantages of the invention include the ability to, based on viewing habits and other 
interactions, identify individual viewers or groups of viewers that possess or share certain 
preferences, demographic, or other relevant traits, and the ability to identify one or more 
1 0 particular viewers, from among a larger group of viewers, and to reliably identify whether that 
S viewer is a man, a woman, or a child. The identification of specific viewers, based on each 
: j viewers current and previous interactivity with a receiver, and the characterization (profiling) of 
53 each of those viewers (or groups of viewers) creates enormous opportunities for content 
i.y providers and advertisers to address their content to an individual or groups of individuals with 
li§ particular demographic traits or who have particular interests. An additional advantage of the 
|=,5, invention is that the local monitoring and profile generation at the set-top is consistent with and 
f;1 affords solutions to security and privacy considerations. 

These and other features and objects of the invention will be more fully understood from 
the following detailed description of the preferred embodiments that should be read in light of 
2 0 the accompanying drawings. 



Brief Description of the Drawings 

The accompanying drawings, which are incorporated in and foraf a part of the 
specification, illustrate the embodiments of the present invention and, together with the ^ 
2 5 description serve to explain the principles of the invention. 

In the drawings: 

FIG. 1 illustrates exemplary television delivery networks for which the current invention 
could be implemented; 
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FIG. 1 illustrates exemplary television delivery systems for v^'hich the current system 
could be implemented; 

FIG. 2A illustrates an exemplary system diagram and architecture of the current 
invention, according to one embodiment; 

5 FIG. 2B illustrates an exemplary context diagram of a Viewer Characterization and 

Profiling System (VCPS), according to one embodiment; 

FIGs. 3A-B illustrate an exemplary head-end channel map table format and channel map 
table respectfully, according to one embodiment; 

FIGs. 4A-B illustrate an exemplary head-end network table format and network table 
liO respectfully, according to one embodiment; 

FIGs. 5A-B illustrate an exemplary head-end BpID table format and BpID table 
03 respectfully, according to one embodiment; 

• J FIGs. 6A-B illustrate an exemplary head-end program table format and program table 

respectfully, according to one embodiment; 

1*S FIGs. 7A-B illustrate an exemplary set-top box (STB) channel map header format and 

channel map table format respectfully, according to one embodiment; 

f f FIG. 8 illustrates an exemplary program schedule; 

FIGs. 9A-C illustrate exemplary STB fixed length program header format and fixed 
length program record format respectively, according to one embodiment; 

2 0 FIGs. lOA-C illustrate exemplary STB variable length program header format, network 

record format, and program record format respectively, according to one embodiment; 

FIG. 1 1 illustrates an exemplary logic flow of how user interactions are captured, 
according to one embodiment; ^ ^' 

FIG. 12 illustrates an exemplary class diagram of how the click stream of a viewer is 
2 5 processed to generate a viewer profile, according to one embodiment; 

FIG. 13 illustrates an exemplary subset of genres and categories as defined by the VCPS, 
according to one embodiment; 
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FIG. 14 illustrates an exemplary conversion of program data, according to one 
embodiment; 

FIG. 15 illustrates an exemplary process flow of genre data, according to one 
embodiment; 

FIG. 16 illustrates an exemplary preferred program category profile, according to one 
embodiment; 

FIG. 17 illustrates an exemplary graphical representation of the genre profile of a viewer, 
according to one embodiment; 

FIGs. 18A-C illustrate exemplary tables for probabilities based on program categories, 
adjustments to those probabilities based on day part, and normalization of the adjusted 
probabilities respectively, according to one embodiment; 

FIG. 19 illustrates an exemplary set of conditional probabilities for different genres, 
according to one embodiment; 

FIG. 20 illustrates an exemplary preferred networks profile, according to one 
embodiment; 

FIGs. 21A-B illustrate examples of records and graphs of network viewership by day part 
profile table and graph respectively, according to one embodiment; 

FIGs. 22A-B illustrate exemplary average viewing duration profile table and graph 
respectively, according to one embodiment; 

FIG. 23 illustrates an exemplary channel change frequency by day part profile, according 
to one embodiment; 

FIG. 24 illustrates an exemplary holding factor profile, according to one embodiment; 

FIGs. 25A-D illustrate exemplary channel order, surf session and dwell time ptofil&s, and 
a dwell time graph respectively, according to one embodiment; 

FIG. 26 illustrates an exemplary process flow chart illustrating how sessions are 
automatically initiated and terminated and new sessions detected, according to one embodiment; 
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FIGs. 27A-B illustrate examples of how the time span of a viewing session can be 
determined and delimited, according to one embodiment; 

FIGs, 28A-C illustrate examples of how viewing sessions can be defined based on viewer 
activity or inactivity, according to one embodiment; 

5 FIG. 29 illustrates a sliding window is used to identify session termination and initiation 

points, according to one embodiment; 

FIG. 30 illustrates sliding windows and their associated data sub-sessions, according to 
one embodiment; 

FIGs. 31A-B illustrate how session data is compared with the signature data already 
l|f stored on the VCPS, according to one embodiment; 

FIGs. 32A-B illustrate an exemplary process flow chart of how events are processed and 

(33 associated with a particular session, and how the sliding window is managed, according to one 

Q embodiment; 

g FIG. 33 illustrates how session data are correlated and matched to signature data, 

£| according to one embodiment; 

i'T FIG. 34 illustrates an exemplary process flow of how session data is saved and matched 

C= to a signature, according to one embodiment; and 

FIGs. 35A-B illustrate an exemplary process flow generating the session profile and 
adding the session profile to a correlated signature profile based on the event queue, according to 
2 0 one embodiment. 



Detailed Description 

of the Preferred Embodiment ^ 

In describing a preferred embodiment of the invention illustrated in the drawings, specific 
2 5 terminology will be used for the sake of clarity. However, the invention is not intended to be 
limited to the specific terms so selected, and it is to be understood that each specific term 
includes all technical equivalents which operate in a similar manner to accomplish a similar 
purpose. 
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With reference to the drawings, in general, and FIGS. 1 through 35 in particular, the 
method and apparatus of the present invention is disclosed. 

The present invention can be implemented for use with various television (TV) delivery 
systems including, but not hmited to, digital broadcast satellite (DBS) systems, switched digital 
5 video (SDV) systems, local multipoint distribution systems (LMDS), multichannel multipoint 
distribution systems (MMDS), hybrid fiber coax (HFC) systems, the Internet, other cable TV 
(CTV) systems, or other terrestrial wireless networks. The TV delivery system can deliver 
programming in various forms, including but not hmited to digital video, analog video, or 
streaming media. The programming may be compressed in accordance with a variety of now 
1 0 known or later discovered compression standards, such as the current Motion Picture Expert 
Group (MPEG-2) standard for digital video. 

sj FIG. 1 illustrates exenaplary embodiments of three of the most common types of TV 

■Q delivery networks discussed above with respect to Rg. 1 with which the present invention can be 
•f^ suitably used. These include DBS 100, CTV 120, and very high-speed digital subscriber line 
1^ (VDSL) 150. A DBS system 100 transmits a programming stream comprising upwards of a 
J~ hundred channels of TV programming directly from a geo-stationary satellite transmitter 102 
5=4 orbiting the earth to a receiving antenna 104 mounted on or near each subscriber's house 106. 
yj The programming stream is transmitted from the antenna via a cable (not shown) to a satellite 

receiving station 108 in the form of a set-top box (STB) in the subscriber's house 106. The 
B© satellite receiving station (i.e., STB) 108 selects a channel and demodulates the signal for 

delivery to a monitor 1 10, such as a television. Most DBS systems 100 are arranged such that 
data also can be sent in the upstream direction, that is, from the STB 108 to the DBS provider. 
In most DBS systems 100, the STB 108 also is coupled to the telephone line and is designed and 
programmed to place telephone calls to the DBS service provider to peripdically send 

2 5 information in the upstream direction. Such information commonly may comprise requests for 

Pay-Per-View (PPV) programs, requests for changes in the subscription (a request that^one or 
more of premium channels be added to the service, etc.). 

A CTV network 120, such as digital cable network, transmits multiple channels of TV 
information from a head end or central office (HE/CO) 190 via a cable network 122. 

3 0 Particularly, the channels are transmitted via cables 124, such as fiber optic cables, to nodes 126. 

The nodes are essentially switching/routing stations that service multiple homes (usually a few 

Patent Application -16- T734-10 



hundred). The nodes 126 route the signals to individual subscribers 128. For digital cable, the 
individual subscribers 128 will have STBs 130 that select a particular channel from the transmit 
stream, demodulate it and forward it for display on one or more monitors (i.e., televisions) 132. 
Upstream information may be sent from the STB 130 to the HE/CO 190 via a dedicated 
5 upstream channel over the cable. In cable systems that do not support two-way communication, 
the upstream "channel" can be through the telephone as described above in connection with DBS 
systems 100. 

A VDSL system 150 transmits TV programming over the regular telephone network. 

Particularly, TV signals are transmitted from a broadband distribution terminal (BDT) 152 
1 0 within the HE/CO 190 via cables 154, such as fiber optic cables, to a universal service access 

multiplexer (US AM) 156 that delivers the data to multiple individual subscriber households 160 
5 via regular telephone twisted wire pair 158 using VDSL modems and protocols. The USAM 156 

receives a wide bandwidth signal comprising some or all of the television channels. However, 
03 because of the bandwidth limitations of twisted pair wire, typically only about one channel of 
ij| television progranuning at a time can be deUvered from the USAM 156 to the household. 
O Accordingly, the subscriber has a STB 162 that is similar in functionality to the previously 
La discussed STBs for DBS and CTV, except that when the user changes channels such as by 
^'^ operating a remote control, the remote channel change signal is received by the STB 162 and 
f== transmitted to the USAM 156 which switches the channel for the user and begins sending the 
||) newly selected channel to the household. Such systems are known as SDV systems. SDV 

systems are essentially fully modem asynchronous two-way communication networks. 

Accordingly, the STB 162 can transmit information upstream via the same VDSL modem that 

receives the downstream signals. SDV systems typically operate using an asynchronous transfer 

mode (ATM) protocol that is well-known in the networking arts. 

25 In an alternative embodiment, the TV signals are transmitted from the BDT 152 to & . . 

broadband network unit (BNU) 164. The BNU 164 deUvers the data to individual households 
160 using coaxial cable 166. 

It should be noted that STBs are described above with reference to Fig. 1. The current 
invention may be incorporated in an STB but may also be incorporated in other devices that can 
3 0 perform the same, similar, or additional functions. These devices may include, but are not 
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limited to, TV, Video Cassette Recorder (VCR), Digital Video Recorder (DVR), Personal Video 
Recorder (PVR), Residential Gateway (RG), and computers. For simplicity the term STB will 
be used herewithin to cover all the various devices. 

FIG. 2A illustrates an exemplary system diagram and architecture of the current 
5 invention, according to one embodiment of the invention. As illustrated, the architecture 
includes a HE 210 and a STB 220. The HE 210 delivers program content to the STB 220 and 
may receive commands, such as channel change commands, or viewer profiles 199 from the STB 
220. The HE 210 includes a conomunications manager 212, a program database 214 and a 
profile database 216. In one embodiment, the HE 210 also includes a compressor 218. The 
1 0 communications manager 212 handles the communications with the STB 220, including 
O packaging and delivering program content to the STB 220 and receiving data, such as commands 
^ or profiles, from the STB 220. The program database 214 stores information about the programs 

>iSsl 

to be broadcast. The profile database 216 stores profiles for multiple households connected 
;|i thereto. The profile database may group or cluster profiles according to similarity. The 
i:i compressor 218 is used to compress program information prior to being transmitted to the STBs 
r 220. 

O The STB 220 receives program content from the HE 210 and delivers the program 

ill content to the TV, receives and processes commands from a viewer, and monitors the 
w interactions of the viewer to generate viewer profile(s). The STB 220 includes a 
2 0 communications manager 222, a user interface 224, a profile engine 230, profile filters 240, a 
profile database 250, a program database 260, a clock 270, and an event queue 280. 

The communications manager 222 handles the communications with the HE 210. The 
communications manager 222 may receive program content or database downloads (i.e., 
program data, channel maps) from the HE 210 and may transmit conunafids (i.e., channel 
2 5 changes), profiles (i.e., updated), or other information including anonymous system stetistks 

(i.e., audience measurements). The user interface 224 allows the viewer to interact with the STB 
220, for example, via a conventional remote control unit. The viewer interactions include, but 
are not limited to, channel and volume changes, EPG activity, and participation in interactive 
entertainment and advertisements. 



Patent Application 



-18- 



T734-10 



The program database 260 stores program data including, but not limited to, name, 
network, time, and genre of programs being transmitted or to be transmitted from the HE 210. 
The event queue 280 stores both viewer-generated events and internal events. Viewer-generated 
events include viewer interactions as mentioned above. Internal events include, but are not 
5 limited to, the end of a program or the end of a day part. The clock 270 runs independently 

within the system. The clock 270 is used to mark the time that the viewer generated events occur 
and to trigger internal events, such as, program changes, end of day parts, and change of day. 
The events are dispatched from the event queue 280 to the profiling engine 230 based on the 
clock 270. 

1 0 The profile engine 230 accepts events from the event queue 280 and, if applicable, 

retrieves program data related to a selected program from the program database 260. The profile 

! engine 230 in conjunction with the profile filters 240, processes the events to produce the viewer 
-J profiles 199 which are then stored in the profile database 250. Each of the profile filters 240 

3 handles a single profile element (as will be described further herein). 

15 It should be noted that typical STBs likely include some type of user interface and 

network interface (i.e., communications manager). The communications manager 222 and the 
= user interface 224 may be typical components of a STB or slight modifications thereof. Also, the 
j.^ clock 270 may be standard component of a STB. The profile database 250, the program database 
j' f 260 and the event queue 280 may be a single memory device or may be separate memory 
2 0 devices. The profile engine 230 and the profile filters are resident software applications stored 
and running on the STB 220. As one of ordinary skill in the art would know, there are numerous 
variations of this architecture that are well within the scope of the current invention. 

The term program data in the context of the present invention is meant to include and 
encompass one or more subsets of information, which identifies, describes and generally 
2 5 characterizes specific television programs and television networks, categories of programs^and 
networks, etc. Program data can be readily obtained from several commercial enterprises 
including TVdata of Glen Falls, NY. In a preferred embodiment, program data is used in order 
to be able to characterize what program and networks are viewed, surfed, or the Hke. For 
instance, program data is used to convert channel, date, and time into a program name and 
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content type. This information determines the types of programs that are preferred by a 
household and how long certain programs hold their attention. 

In a preferred embodiment, the HE 210, or other upstream system, generates and stores 
the program database 214 and transmits relevant program data to the STB 220. According to one 

5 embodiment, the relevant program data are placed on a download carousel, where they are 
transmitted in their entirety to the STB 220. Alternatively, the relevant program data can be 
downloaded periodically in smaller increments. In another embodiment, the program data is 
transmitted with the programs. For example, the program data is transmitted within the vertical 
blanking interval (VBI) or the program specific information (PSI) of a transport stream, such as, 
10 a Motion Picture Expert Group (MPEG-2) transport stream. It should be understood however, 

s= =1 that many alternate methods for getting the program data exist and the utility and scope of the 

■-[f present invention is not limited by how the data is obtained. 

In one embodiment, the HE 210 aggregates the program data by cable supplier, date, 
time, and television network to produce a variety of tables. These tables or a variation thereof 
1;| are downloaded to the STB 220. Furthermore, the STB 220 may receive program data in one 

format and subsequently process the downloaded program data to produce other tables and 
23 alternate formats. As will be evident to those skilled in the art, the format of the program data at 
i*r the HE 210, the format by which it is downloaded to the STB 220, and the format of the program 
J'J data stored at the STB 220 can be widely variable. The invention is in no way limited to the 

2 0 specific formats discussed and illustrated below. 

FIG. 2B depicts a context diagram of a Viewer Characterization and Profiling System 
(VCPS) 290 according to one embodiment of the invention. The VCPS 290 monitors 
interactivity 291 of one or more viewers 292 with the TV. The interactivity includes, but is not 
limited to, channel changes, volume changes, EPG activity, and pow^ ofi/off events. According 
25 to one embodiment, the VCPS 290 generates one or more viewer profiles 293 (sessionj)rofiles 
294 and signature profiles 295) based on the interactivity. The viewer profiles 293 identify 
characteristics about one or more categories of viewing habits of the user, including but not 
limited to, channel change rate, dwell time, amount of TV watched, volume habits, and EPG 
habits. According to another embodiment, the VCPS 290 uses program data 296, such as 

3 0 program and network information 297 (i.e., stored in program database 260), to further define the 
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viewer profiles 293 to include viewing preferences related to specific networks, program types, 
and programs. According to another embodiment, the VCPS 290 utilizes heuristic rules 298 to 
further define the viewer profiles to infer preferences and demographic traits of the viewer. 

The session profiles 294 identify characteristics about the user (or group of users) for that 
5 particular viewing session. The signature profiles 295 are a compilation of closely related 
session profiles 294 (are associated with the same user or group of users). The VCPS 290 can 
identify which user (or group of users) are interacting with the TV by correlating the current 
session profile 294 with one or more signature profiles 295. The session profile 294 is identified 
with the signature profile 295 having the highest correlation, as long as it meets a predefined 
10 correlation threshold. The identification is an identification of preferences and not necessarily an 

actual identification of the user (or users). The viewer profiles 293 can be used for a variety of 
& purposes including, but not limited to, delivering targeted content mcluding advertising and for 
al distinguishing a particular viewer from a household of multiple viewers. 

yy FIGs. 3A and 3B illustrate an exemplary channel map table format and channel map table 

that may be stored in the program database 214. As illustrated in FIG. 3A, the channel map table 
» format includes for each field a column for name 300, type 302 (i.e., integer, string), size 304 
Q (number of bytes), flags 306, and description 308. The fields captured in the channel map table 
J'^ include user perceived channel indicator (UPCI) 3 10 and broadcast program ID (BpID) 312, 
0 which are both required 2 byte integers. The UPCI 310 is the channel number illustrated on the 
"20 front panel of the STB 220 and the BpDD 312 is a service provider's reference number for a 

particular network or presentation stream. As illustrated in FIG. 3B, the table simply converts 
the UPCI 310 that the viewer recognizes into the BpID 312 that the HE 210 recognizes. For 
example, a UPCI of 02 is a BpID of 27. 

FIGs. 4A and 4B illustrate an exemplary network table formal afd network table that 
25 would be stored in the program database 214. As illustrated in FIG. 4A, the network jabl^" 
format includes for each field a column for name 400, type 402 (i.e., integer, string), size 404 
(number of bytes), flags 406, and description 408. The fields captured in the network table 
include network ID (NET_ID) 410, network name (NET_NAME) 412, network call sign 
(NET_CALL) 414, and network reference (NET_REF) 416. Each of the fields has the 
3 0 characteristics defined in the table. For example, the NET_ID 410 is a 4-byte long integer that is 
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a unique and required field. FIG. 4B illustrates how for each NET_NAME 412 and NET_CALL 
414 there is a unique NET_E) 410 assigned and potentially a unique NET_REF 416. For 
example, WHYY FOX Philadelphia is assigned a NET_ID of 0001, but does not have a 
NET_REF. 

5 FIG. 5A illustrates an exemplary BpID table format for a BpID table that would be stored 

in the program database 214. As illustrated, the BpID table format includes a column for name 
500, type 502, size 504, flags 506, and description 508. The fields captured in the BpID table 
include a unique record ID (BID_ID) 510, the BpID (BID_BPID) 512, a broadcast domain 
number (BID_BDMNUM) 514, and the NET.ID (BID_NETID) 516. Both the BID_BDMNUM 
10 5 14 and the BID_NETID 5 1 6 are foreign keys that point to the network table (FIG. 4B) and a 
broadcast table (not illustrated) respectively. FIG. 5B illustrates an exemplary BpID table. 

■J^ FIG. 6A illustrates an exemplary program table format for a program table that would be 

stored in the program database 214. As illustrated, the program table format includes a column 
0 for name 600, type 602, size 604, flags 606, and description 608 columns. The fields captured in 
£S the program table include a unique program ID (PRG_ID) 610, program start time 
.r (PRG_START) 612, program end time (PRG_END) 614, the NET_ID (PRG_NETID) 616, and 
U program title (PRG_TITLE) 618. The program table contains the program data provided by, for 
example, TVData. As such, the program table should also include the program type and program 
p category (not illustrated) that TVData uses to classify programs. FIG. 6B illustrates an 
S~0 exemplary program table. 

As would be understood by those of ordinary skill in the art, the STB 220 will likely not 
have sufficient memory to store the entire program database 214 in the program database 260. 
As such, either the HE 210 will modify the program database 214 and transmit the modified 
database to the STB 220 or the HE 210 will transmit the entire program latabase 214 and the 
25 STB 220 will process the data and store a modified version thereof. In either event, exemplary 
headers and table formats for the STB 220 are discussed below. 

Within the STB 220 the channel map and the network tables can be combined into a 
single table that would be specific to a local service domain or zone. FIG. 7A illustrates an 
exemplary channel map header format. As illustrated, the channel map header format includes 
3 0 for each field a column for name 700, type 702 (i.e., integer, string), size 704 (number of bytes). 
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flags 706, and description 708. The fields captured in the channel map header include channel 
map version (version) 710, number of records in the channel map (count) 712 and size in bytes 
of the record (size) 714. 

FIG. 7B illustrates an exemplary channel map record format. As illustrated, the channel 
5 map record format also includes a column for name 700, type 702, size 704, flags 706, and 
description 708. The fields captured in the channel map record include UPCI 730, BpID 732, 
network call sign (netcall) 734, network reference number (netref) 736, and an index of the 
network in a program table (netidx) 738. The netidx 738 has a different meaning based on 
whether the program database 260 has a fixed or variable number of records (discussed below). 
10 As one skilled in the art would recognize, the channel map table size would be the number of 
presentation streams multiplied by the record size plus the header size. As illustrated the header 
is 6 bytes and the record is 24 bytes, so if there were 200 presentation streams the channel map 
table would be 4806 bytes. 

A program table within the STB 220 may be either a fixed or variable length table. For a 
M fixed length program table the time period covered will be divided into time slots of equal 
J ~ length, for example, 5 minutes, 10 minutes, 30 minutes. As would be obvious to one skilled in 

the art the granularity depends on the amount of storage in the STB 220. The fixed length 
y program table stores information about the programs appearing in those slots for each of the 
f =ij networks. It is possible that one network may have multiple programs during a single time slot. 
Wo In those cases, the program table captures the program that airs for the majority of the time slot. 
For example, FIG. 8 illustrates an exemplary program schedule displaying programs 800 airing 
for each of three networks 810 for the 8:00 to 8:30 time slot 820. While ESPN and CNN each 
only have one program during this time slot, TBS has two different programs airing in this time 
slot, since TBS programs are aired at 5 and 35 minutes past the hour. The program data table 

2 5 would assign Ripley' s Believe It or Not! to the 8:00 time slot since it covers 25 of the 30 minutes 

of that slot. 

The fixed record size and fixed slot size allow the VCPS 290 to very rapidly locate the 
program information for the network at the given time. The fixed length program table is also 
easily divided into one or two hour increments. This enables the program data for the current 

3 0 time to be transmitted to the STBs 220 quickly and often. In a fixed length table, each network 
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has only one entry per time slot. The program data is found by locating the start of the time slot 
corresponding to the event time and adding the netidx 738. 

FIG. 9A illustrates an exemplary fixed length program header format. As illustrated, the 
fixed length program header format includes for each field a column for name 900, type 902, size 
5 904, flags 906, and description 908. The fields captured in the fixed length program header 
include program table version number (version) 910, number of minutes per slot (slotsize) 912, 
number of slots (slotcount) 914, starting date/time of the program table (startdate) 916, and 
number of networks per slot (netcount). 

FIG. 9B illustrates an exemplary fixed length program record format. As illustrated, the 
1 0 fixed length program record format also includes a column for name 900, type 902, size 904, 
% flags 906, and description 908. The fields captured in the fixed length program record include 
'O network reference number (netref) 930, program type (type) 932 and the program ID (prgid) 934. 
f jl The netref 930 verifies that the channel map corresponds to the program table by requiring that 
Jt this field match the Uke-named netref field 736 in the channel map corresponding to the network 
111 reference number. The prgid 934 is a host-assigned unique program ID, and is used to calculate 
Li the holding factor for programs. Programs that are the same, even if aired at different times, will 

have the same prgid 934. The type 932 contains the type or category of the program 
H corresponding to the prgid 934. 

^= As one skilled in the art would recognize, the fixed length program table size would be 

2 0 approximately 96K for 200 channels having 30 minute time slots and 24 hours of data (200 

channels * 10 bytes/time slot/channel * 48 time slots). If the program table size was too large, 
the time slots could be increased, the amount of time captured for in the table could be 
decreased, the netref 930 could be removed so that the verification wasn't performed or other 
modifications that would be obvious to one of ordinary skill in the ari ' 

25 A variable length program table is particularly useful when the accuracy of ex^tlywhat 

program is being viewed is of critical importance. The variable length program table includes a 
variable length program header (Fig. lOA), a variable length network header (Fig. lOB), and one 
or more variable length data records (Fig. IOC). 

FIG. lOA illustrates an exemplary variable length program header format. As illustrated, 

3 0 the variable length program header format includes for each field a column for name 1000, type 
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1002, size 1004, flags 1006, and description 1008. The fields captured in the variable length 
program header include a program table version number (version) 1010, starting date and time of 
the program table (startdate) 1012, duration of the program table (duration) 1014, and number of 
networks (netcount) 1016. 

FIG. lOB illustrates an exemplary variable length network header format. As illustrated, 
the variable length network header format includes fields for network reference number 
(NETREF) 1020, and number of programs for the network (count) 1022. 

FIG. IOC illustrates an exemplary variable length data record format. As illustrated, the 
variable length data record format includes for each program a start time (start) 1030, which is 
expressed as difference from the startdate 1012, duration of program (duration) 1032 measured 
in minutes, program type (type) 1034, and a program ID (prgid) 1036. 

In a variable length table, all program data for a single network is specified prior to 
moving on to the next network. The netidx 738 provides the location for the start of the program 
data for that network. The program information is then sequentially searched to locate the 
program time that corresponds to the event. A variable length program table provides the VCPS 
290 the most exact data about the programs. Since the number of records is variable, the exact 
table size is unknown. As would be obvious to one of ordinary skill in the art, the data sizes of 
the program records can be reduced at the cost of flexibility and accuracy. 

FIG. 11 illustrates an exemplary logic flow of how user interactions are captured by the 
VCPS 290. The user interface 224 receives user interactions 1 100, such as, channel changes 
1 102 and power on/off 1 104, and forwards these interactions 1 100 to the event queue 280. The 
event queue 280 captures these viewer interactions 1 100 and time stamps them according to the 
time 1110 provided by the clock 270. The clock 270 is also used to initiate internal events 1 120, 
such as, change of day part (not illustrated), and end of show 1122, and these internal events 
1120 are also stored in the event queue 280. As illustrated, the event queue 280 includes four 
entries, each entry identifying the time 1110 and the type of event. For example, the first event 
was a channel change 1102 that occurred at 10:29:55. The entries from the event queue 280 and 
the time 1110 from the clock 270 are provided to the profile engine 230. In a preferred 
embodiment, the entries are taken in time order (i.e., first in, first out). 
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FIG. 12 illustrates an exemplary class diagram of how the click stream of a viewer is 
processed to generate a viewer profile. A ClickStreamManager 1210 manages a 
ClickEventListener 1220, a ClickStreamProcessor 1230, and a ProgramContentManager 1240. 
The ClickEventListener 1220 listens for events, both internal and via the user interface, and time 
5 stamps the event with the time provided by a clock 1250. As one of ordinary skill in the art 
would recognize, internal events, such as change of day part, change of day, and end of program, 
are not related to the click stream of a viewer but may still be captured by the ClickEventListener 
1220 and processed by the other components of the exemplary class diagram of Fig. 12. 

The ClickStreamProcessor 1230 retrieves events from the ChckEventListener 1220 and, 
10 if applicable, requests program data for a selected program from the ProgramContentManager 
1240. The ProgramContentManager 1240 retrieves the applicable program data from 

fill ap 

«J ProgramContent 1270. The program data includes, but is not limited to, program title, program 
yg genre, program category, and network. The ClickStreamProcessor 1230 then processes the 
■ :f events and, if applicable, program data utilizing ClickStreamFilters 1260 to filter events such 
if that only the appropriate event types are used in generating profile categories. 

3 The ChckStreamFilters 1260 direct and add data to the relevant profile category to 

f i generate or update a Profile 1290 comprising multiple profile categories. Such categories, as 

described further herein, include preferred networks, preferred programs, viewing duration, 
Q channel change frequency, and holding factor. The ClickStreamProcessor 1230 also generates a 
5 0 ClickEventRecord 1295, which is a record of events for a given session or other time period. 

In one embodiment, the ClickStreamProcessor 1230 will continually poll the 
ClickEventListener 1220 for the next event. If no events have been triggered, the 
ClickStreamProcessor 1230 will sleep for a short period of time. Otherwise, the 
ClickStreamProcessor 1230 will handle an event and then immediately check for the next event. 
2 5 Event processing is based on the event time. According to this embodiment, the 

ClickStreamProcessor 1230 handles events, such as channel change and power on/off events, 
from the remote control to dynamically update a viewer's profile. Thus, the 
ClickStreamProcessor 1230 spends most of its time polling the ClickEventListener 1220 for 
new events. When an event occurs, the ClickStreamProcessor 1230 passes the 
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ClickStxeamFilters 1260 (an array of profile filters), which extract specific data for the household 
or viewer profile. 

Referring back to Fig. 2A, the profile filters 250 are a set of components that implement 
an interface and provide mechanisms to profile the viewer based on viewer interaction with the 
television. Each event that occurs, whether viewer interaction or an internal event, is passed to 
each profile filter 250. Each profile filter 250 determines whether or not the event is relevant to 
the data the profile filter 250 is tracking. Each filter performs an aspect of profiling. For 
example, one filter may track average channel change frequency while another tracks preferred 
content. 

In a preferred embodiment, profiles generated by the VCPS 290 include multiple profile 
categories, each category reflecting distinct relevant characteristics about a viewer's viewing 
preferences and habits. Examples of profile categories include, but are not limited to: 

• preferred programs - measure of the amount of time a viewer(s) watch a particular 
program or program category; 

• preferred networks - measure of the amount of time viewer(s) watch a particular 
network; 

• viewing duration - amount of viewing time per viewing session; 

• channel change frequency - average number of channel changes per time period; 

• holding factor per program or program category - how much of an entire program 
or program category a household or individual watches; and 

• surf sequence - typical order networks are visited in a surf routine. 

As one of ordinary skill in the art would recognize, there are fflirrferous other profile 
categories that could be included that would fall within the scope of the current invention. JThe 
current invention is in no way not limited to the categories described herein. 

In order to generate a preferred programs profile, the VCPS 290 collects information 
(characterizations) about the programs that a viewer or household watches. The 
characterizations about the programs may include, but is not limited to, types, categories, genres, 
or some combination thereof. In a preferred embodiment, the characterizations will include 
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genre and category for each program. The genre is a consistent high-level classification of a 
program (i.e., a generic set of program types or categories), such as sports, comedy, drama, etc. 
A category is a sub-class of the genre classification that is a more specific classification than the 
genre. 

FIG. 13 illustrates an exemplary subset of genres 1310 and categories 1320 as defined by 
the VCPS 290. As illustrated, a comedy genre includes categories for movie, network series, 
syndicated, and TV movie. As one of ordinary skill in the art would recognize, the number and 
type of genres 1310, the number and type of categories 1320, and the relationship therebetween 
can be modified without departing from the scope of the current invention. 

The program genre and categories may be defined by the VCPS 290, received from a 
service provider (e.g., TVData), or derived from a service provider. In a preferred embodiment, 
program characterizations are obtained from, for example, TVData and these characterizations 
are converted into genre and category. TVData provides a program type and category for each 
category and these characterizations are translated (e.g., mapped) into the program classifications 
of genre and category. 

FIG. 14 illustrates an exemplary conversion of program type 1400 and program category 
1410 provided by TVData converted to program genre 1420 and program category 1430 in the 
VCPS 290. For example, a TVData program type "SY" (syndicated) and category "comedy" 
maps to a VCPS genre "comedy" and type "syndicated". The TVData category is not 
necessarily a subset of the type. Moreover a "*" indicates a "match all" meaning that the type or 
category matches all categories or types respectively. The VCPS 290 types are a subset of the 
genre. 

The VCPS 290 tracks the total number of seconds that a viewer watches particular 
program categories (genres). The VCPS 290 responds to all channel change and power on/off 
events. When a channel change occurs, the VCPS 290 records the time and network,'«nd%)cates 
the program's characteristics from the program table. When the next channel change occurs, the 
VCPS 290 notes the elapsed time and stores that elapsed time in an array. Because it is possible 
that a program will end prior to a channel change, the VCPS 290 saves the elapsed time to the 
appropriate program category (genre) and then gets the program information for the program that 
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is about to air such that when a channel change or power off event occurs the appropriate time 
spent on each program category will be accurate. 

The VCPS 290 may also track the preferred program categories (genres) of the household 
by day or day part. A day part is a range of time during the day that is used by advertisers to 
characterize viewers. The VCPS 290 can create an event for a day part by saving elapsed time to 
the specific day part and generating a new event for the next day part. 

FIG. 15 illustrates an exemplary process flow for processing genre data in order to 
generate a viewer profile. The process starts when a new event is detected and is passed to the 
genre filter at step 1501 . A check is made as to whether or not the event is a power on event or a 
new session event at step 1503. If yes at step 1503, the current session data up to that point is 
cleared at step 1505, and the current channel and event time are saved (i.e., stored in memory for 
the new session) at step 1507. If no at step 1503, then the actual program data associated with 
the channel tuned to at the event time is obtained at step 1509. The genre for the particular 
program is then determined, utilizing the program data, at step 1511. The total genre viewing 
time is calculated at step 1513. The current session profile is then updated to reflect the total 
genre viewing time for that session at step 1515. The current channel and event time are then 
saved at step 1507. The processing of the event is then complete (step 1517). 

FIG. 16 illustrates an exemplary preferred program category (genre) profile 1600, 
reflecting the top five program categories (genres) chosen by this viewer and the associated 
relative durations that those program categories were watched. The category profile 1600 
includes program type (genre) 1610, duration 1620 (total time spent viewing each program type 
1610 in HH:MM:SS format), percent 1630 (percentage of the total viewing time spent viewing 
that the particular program type) and a bar graph 1640 reflective of the percent 1630, or the 
duration 1620, or both for each program type 1610. As illustrated, tte number one program type 
(genre) 1610 is shopping, which this particular viewer has viewed over 30% of the tin^. ^ 

Note that this example profile is an aggregated profile comprising more than a weeks 
worth of data. A similar session profile comprising a much smaller span of viewing time may 
also be obtained by the VCPS 290. As one of ordinary skill in the art would recognize, there are 
numerous other formats of this graph that would be well within the scope of the current 



Patent Application 



-29- 



T734-10 



invention. For example, the bar graphs 1640 could be a pie chart or the chart could be further 
broken out by day or day part. 

FIG. 17 illustrates an exemplary graphical representation of the genre profile of a viewer 
based on viewing time of each genre. The genres 1700 (i.e., art, entertainment) are displayed 
5 along the horizontal axis and the viewing duration (time) 1720 is displayed along the vertical 
axis. In the exemplary illustration, the total viewing time 1730 is identified in the upper right 
hand comer. As illustrated, the entertainment genre is watched the most at approximately 10 
hours of the total of nearly 23 hours of programming that have been monitored. As one of 
ordinary skill in the art would recognize, the graph could have different styles, formats, or 
1 0 orientations, or could be a table or other representation, without departing from the scope of the 
W current invention. For example, the graph could be further broken out by day or day part. 

As previously discussed, program categories 1320 are more specific than genres 1310 
yj and therefore provide for increased granularity. The VCPS 290 also tracks program categories 
Cj 1320 in the same fashion as it tracks program genre 1310. An exemplary graphical 
16 representation of the program category profile is not illustrated. However, as one of ordinary 
hi skill in the art would recognize, the graph of the program category profile would be similar to the 
r", genre profile illustrated in FIG. 17 except that it would have more data points and each data point 
would likely account for less time. In one embodiment, all the program categories may be 
illustrated on one graph. In an alternative embodiment, a separate graph may be illustrated for 
2 0 each genre. According to one embodiment, the program category profile can be broken out by, 
for example, day part. 

A viewer type profile estimates what type of viewer (i.e., man, woman or child) is 
watching a particular program. In one embodiment, the VCPS 290 uses program classifications 
(genre/category) and day part information to derive the estimates. FK.lSA illustrates an 
2 5 exemplary table of probabilities of the viewer type based on the genre/category of prggranjs. ' 
The table includes a genre/category column 1810 and columns associated with the probability of 
a viewer of a program having that genre/category being a male 1812, a female 1814 or a child 
1816. As illustrated in the exemplary table, the probability of a man watching an action/movie is 
40%, while the probability is 30% for woman and children. 
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The VCPS 290 may adjust the probability data based on the day part. For example, 
because the probability that a daytime viewer is a man is lower than the probabihty that the 
viewer is a woman or a child, this fact results in an adjusted and reduced probability that the 
daytime viewer is male. FIG. 18B illustrates an exemplary day part adjustment table, according 
5 to one embodiment. The table includes a day part column 1820 and adjustment columns for men 
1822, women 1824, and children 1826. The adjustment factor is multiplied by the probability 
defined in the table of Fig. 18A to determine an adjusted probability. An adjustment value of 1.0 
indicates that no adjustment is required, while values smaller than 1.0 will adjust the probability 
downwards, and values larger than 1.0 will adjust the probability upwards. For example, the 
10 adjustment factor for weekdays between 09:00 - 16:00 is .3, .9 and 1.0, for men, women and 
children respectively. 

As one of ordinary skill in the art would recognize, applying the adjustment factor will 
'^1^ likely mean that the sum of the probabilities for a particular day part will not equal 1 .0. Hg. 18C 
03 illustrates an exemplary table for normalizing the probabilities. The table includes columns for 
f $ viewer type 1830 (man, woman, child), for adjusted probability 1840, adjusted sum 1850, and 
■0 normalized probability 1860. Using a viewer watching an action movie (respective probabilities 
|=% of .4, .3 and .3 from Fig ISA), during daytime hours (respective adjustments of .3, .9, 1 from Fig. 

1 8B) the viewer has an adjusted probability of . 1 2, .27 and .3 of being a man, women or child 
1=^ respectively (see column 1840). As illustrated, the adjusted sum 1850 is .69 and the normalized 
ij) probabilities 1860 are .174, .391 and .435 respectively. 

The distinction of a viewer type by gender, as described above, can be readily extended 
to other viewer types and demographics, and in general any set of heuristic rules or probabilities 
can be appUed to the viewer interactivity data to generate predicted viewer traits and 
demographics, as will be evident to those skilled in the art. The pres|ntiinvention is not meant to 
25 be limited to distinguishing three viewer types as described, but could include additional sets of 
rules and probabilities which, in conjunction with viewer interactivity data, can be used to derive 
or infer viewer demographics and other attributes. 

FIG. 19 illustrates an exemplary table of conditional probabilities of the likely viewer 
demographic makeup for different program categories (genres). The table includes different 
3 0 demographic groups 1910 (i.e., age, income) for different program categories 1920 (i.e., news, 
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fiction). Each demographic group 1910 has numerous categories 1930 defined thereunder (i.e., 
ages 0-10, 10-18). Each cell in the table contains a conditional probability that represents the 
likelihood that the viewer or viewing group watching a particular category (genre) of program 
1920 is within a particular category 1930 for each demographic group 1910. For example, as 
5 illustrated there is a 10% chance that a viewer watching the news will be between the ages of 0- 
10. As will be evident to those of ordinary skill in the art, other sets of rules and probabilities 
could also be utilized without departing from the scope of the present invention. 

In order to generate a preferred network profile, the VCPS 290 tracks the networks that 
are most watched by each viewer or household. FIG. 20 illustrates an exemplary preferred 
1 0 networks profile 2000, reflecting the top five networks 2010 chosen by this viewer, the 

associated percentage 2020 and relative duration 2030 those networks were watched. Horizontal 
bars 2040 also graphically illustrate the relative duration of network viewing. The VCPS 290 
^fl responds to all channel change and power on/off events. When a channel change occurs, the 
03 VCPS 290 records the time and network, and when the next channel change occurs, the VCPS 
If 290 notes the elapsed time and stores that elapsed time in an array. The VCPS 290 may also 
^3 track the preferred networks by day or day part. Note that this exemplary profile is an 

aggregated profile comprising more than a weeks worth of data. A similar session profile 
f comprising a much smaller span of viewing time may also be generated by the VCPS 290. 

FIG. 21 A illustrates an exemplary network viewership table broken out by day part. The 
Wo day parts may be associated with a single day, weekdays, weekends, or other intervals that may 
be appropriate. As illustrated, the breakout is for weekdays. The network viewership table 
includes a network column 21 10 and various day part columns 2120. As illustrated, there is a 
total of eight-day parts having varying time durations associated therewith. The viewership 
duration for each day part for each network is recorded in the appropriate cell of the table. As 

2 5 illustrated, the time is in seconds, but could be in minutes, hours or other suitable time . . 

parameters. As one of ordinary skill in the art would recognize, the number and duration of the 
day parts could vary without departing from the scope of the current invention. 

FIG. 21B illustrates an exemplary graphical representation of viewership by network day 
part. The graph displays the total time 2130 (illustrated in hours) on the vertical axis and the 

3 0 channels (networks) 2140 illustrated on the horizontal axis. 
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A viewing duration profile is the average duration of each viewing session. This is useful 
for determining how much television a viewer or household watches at a time (e.g., per session). 
This information can help identify the households that do not watch television often. To collect 
statistics for this profile, the VCPS 290 responds to power on/off events. The VCPS 290 tracks 
the elapsed time between a power on event and a power off event. The VCPS 290 also generates 
an average duration of each session (i.e., average viewing duration for all viewing sessions). 

FIG. 22A illustrates an exemplary average viewing statistics profile 2200. The profile 
2200 includes average statistics for viewing duration 2210, click frequency 2220, and the 
holding factor 2230. The viewing duration 2210 depicts the total amount of time (i.e., in 
minutes) of this particular viewing session. The click frequency 2220 depicts the frequency of 
channel changes during the particular viewing session. The holding factor 2230 is how much of 
an entire program a household watches and the average holding factor 2230 is the average of all 
the holding factors. No matter how many channel changes occur, the VCPS 290 will track the 
total time that each program is viewed. 

The VCPS 290 also tracks viewing duration by day or day part. FIG. 22B illustrates an 
exemplary viewing duration profile by day part 2250. The profile 2250 includes the total 
duration 2255 (i.e., in hours) that the profile 2250 is based on, a period column 2260 and day of 
week columns 2270. The period column 2260 indicates the time of day, for example, as 
illustrated, the periods are each 4-hour periods of time (e.g., 8pm to midnight). The days of 
week columns 2270, illustrate each day of the week (e.g., Monday, Tuesday, etc.). The profile 
2250 tracks the viewing duration (i.e., in hours) for each period of time for each day of the week. 
As illustrated, the greatest viewing duration was on Friday between the hours of 8pm and 
midnight (e.g., 17 hrs). 

A channel change frequency profile measures how often or rapidly a viewer or household 
changes channels. This information can be used to determine characteristics about thahoy.sehdld 
and it can help differentiate individual users within the household. FIG. 23 illustrates an 
exemplary channel change frequency by day part profile 2300. The channel change frequency is 
expressed as the average number of channel changes per time period (i.e., 30 minutes). As 
illustrated, the profile 2300 includes the total number of channel changes (cUcks) 2310, a period 
column 2320, and day of week columns 2330. The number of clicks 2310 is recorded for a 
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certain time period (e.g., one week). The period column 2320 includes time periods in which the 
channel changes are calculated (i.e., 4pm to 8pm). The day of week columns 2330 include a 
colunm for each day of the week (i.e., Monday, Tuesday, etc.). The profile 2300 tracks channel 
changes and calculates channel change frequency for a given day, during a given period of time. 
5 As illustrated, for Tuesday between 4pm and 8pm the channel change frequency calculated for 
that day part is illustrated as "0". To collect statistics for this profile 2300, the VCPS 290 
responds to all channel change events, and collects the number of channel changes. The profile 
2300 is then generated by calculating the average channel change rate for a predetermined period 
of time (e.g., 30 minutes). 

10 A holding factor profile generally indicates relative interest levels in certain programs by 

^ :^ a viewer or household. A holding factor is how much of an entire program a viewer or 
iJj household watches. The VCPS 290 tracks the total time that each program is viewed. FIG. 24 
y illustrates an exemplary holding factor profile 2400 for an example household for specific 

programs. For each program title 2405, a holding factor 2420 along with the viewing duration 
If 2415 (i.e., in seconds) and dwell time 2410 (i.e., in seconds) for a sample household is 

illustrated. The average holding factor takes into account the total time a particular program is 
C- watched for the entire duration of the program. For example, an hour-long program airs on 
r i channel A. Assuming a household watches channel A for 20 minutes, then changes to another 
f channel for 10 minutes, then returns to channel A for 20 minutes, and finally turns the STB 220 
2 0 off. The holding factor 1220 for the program on channel A is 40/60 = 66.7%. Note that in this 
example the holding factor is the total time spent viewing an individual program. For holding 
factors that are tracked by individual programs, a program reference number identifies each 
program. The VCPS 290 stores the program reference number in the program table. The VCPS 
290 responds to power on/off events, channel change events, end of program events, and end of 

2 5 day part events. It is to be understood that the holding factor profile 2400 could also be applied 

to networks, program categories, genres, etc, and is not limited to specific program. ^ 

A channel search sequence is the order that the viewer typically visits specific networks, 
and this may be tracked by day part. The VCPS 290 tracks the order the networks are typically 
visited by averaging the visit position of each network over a series of channel surf sequences. 

3 0 This is quite useful for profiling and viewer identification as there may be certain channels that a 

particular viewer typically watches, or a specific surf sequence a particular viewer may perform. 
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FIG. 25A illustrates an exemplary table with data related to surf sequence channel order 
being captured and processed. The table includes a column for networks 2500, channel surf 
sequences 2510, 2520, 2530 and a typical search order 2540. The networks column 2500 lists 
networks that are desirable to track. As illustrated five networks are tracked (CNN, ESPN, 
5 fflST, TBS, TLC). The channel surf sequence columns 25 10, 2520, 2530 capture the order that 
each network was selected during a surf sequence. For example, in the first surf sequence 2510, 
ESPN was the first network that was selected. The typical search order 2540 is a weighted 
average of all surf sequences 2510, 2520, 2530. Notice that in third surf sequence 2530, the 
viewer did not visit the HIST channel. In this case, the typical order for the history channel 
10 relies only on the two channel surf sequences 2510, 2520 that it has data for. 

Alternatively, a viewer may always sequentially search channels, such a channel range of 
=0 the sequential search can help identify the viewer or an associated signature. As discussed 
^ further below, the VCPS 290 compares the session-based search order with the search order for 
all signatures. 

f 5 FIG. 25B illustrates an exemplary table having data related to number of times each 

Q network is selected during a viewing session being captured. The table includes a network 
r° column 2500 and a surf session column 2550. The networks column 2550 Usts networks 
O desirable to track (CNN, ESPN, LIFE, TBS, TNT, USA are illustrated). The surf session column 
2550 lists how many times a particular network was selected in a viewing session. As illustrated, 
2 0 ESPN was selected in 3 surf sequences and USA was not selected. Many networks typically 
carry certain genres of programs, such as sports, drama, comedy, talk illustrates, etc. for a 
specific day part. Because viewers have varying preferences for certain types of programs, 
particular viewers will look for specific programs on certain networks based on the time of day 
and day of week. Different viewers also have certain differing dislil^s and each-viewer may 

2 5 avoid surfing disliked networks entirely. The VCPS 290 monitors the networks visited during 

channel surfing for the current day part and compares the data to signatures to see if there is 
similar data. 

The VCPS 290 keeps track of two arrays, one for the number of times the viewer visits a 
network (count 2560) while surfing and the other to determine the total dwell time 2570 on that 

3 0 network while surfing. This data can also be tracked by day part. The VCPS 290 also calculates 
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the average channel dwell time, which is calculated by dividing the data in the total time array by 
the data in the total count array. The result indicates the amount of time that a viewer watched 
each particular network during a channel surf sequence. This result can be used to determine 
whether the program on that network is of interest or potentially of interest (i.e., the viewer spent 
5 time determining if they were interested) or not (i.e., the viewer was simply surfing past the 
channel). Viewers will typically have a higher average dwell time on networks that carry 
programs of interest to the viewer. Fig. 25C illustrates an exemplary table capturing dwell time 
2570 and surf count 2560. 

FIG. 25D illustrates a graph of the average channel dwell time for all networks in a given 
10 session. The networks are represented by the channel number on the horizontal axis 2580. The 
dwell time is represented along the vertical axis 2590 (i.e., in seconds). In addition to networks 
;Q having different average dwell times based on viewer interest levels, the overall average dwell 
% time can vary and be characteristic of particular viewers (indicate demographic or psychographic 
C3 traits). For instance, shorter dwell times may indicate these viewers may be able to discern 
=|5 content faster than others and thus be associated with higher intelligence and relative income. A 
'^^^^ Fast Fourier Transform (FFT) may also be used to analyze the sequence of channel changes to 
r extract frequency information. 

iJ A method will now be described for profiling viewers on a session-by-session basis and 

□ for differentiating viewers in a household of multiple viewers. When the identity (or profile) of a 
'"S 0 specific viewer or viewers that are watching during any given session can be identified and 

distinguished from other potential viewers (or profiles), targeted and custom content, including 
ads, can be delivered to those viewers. Such targeting will be much more efficient and accurate 
than if the targeting was based solely on household demographics or on a household profile. In a 
preferred embodiment, the viewer profiling and identification is accompHshed by monitoring and 

2 5 processing channel change data, but it could also be accomphshed by processing other forms of 

viewer interactivity including volume adjustments, EPG activity, etc. Moreover, the algorithms 
and methods described herein are exemplary methods and implementations and are by no means 
the only possible mechanisms or implementations of the present invention. 

The VCPS 290 uses the concept of a viewing session to identify and profile individual 

3 0 viewers and households. In a preferred embodiment, a viewing session is a period of time during 
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which the viewers do not change (i.e., those viewers watching at the start of the session are the 
same viewers watching at the end of the session). The VCPS 290 identifies the different viewing 
sessions and the individuals (and their associated profiles) involved in those sessions. Although 
viewing sessions can be determined and delimited by a viewer's self -identification (e.g., via a 
5 viewer specific remote, viewer login or custom menu usage), in the preferred embodiment, 
automatic detection of viewing sessions and associated viewer or viewers is employed. An 
exemplary embodiment of automatic detection is now described, but it is to be understood, as 
will be evident to those skilled in the art that the invention is not limited to the exemplary 
embodiment described, but could be implemented in a variety of ways. 

1 0 FIG. 26 illustrates an exemplary process flow chart illustrating how sessions are 

automatically initiated, terminated and new sessions detected, according to one embodiment. 
iQ The power is turned on at step 2601 resulting in the start of a new session at step 2603. Any 
"'f events for that session are processed at step 2605. A determination is made if there is a change 
53 in viewing pattern at step 2607 (discussed in more detail later). If there was a change at step 
; J5 2607, the session is ended at step 2613. If there was no change at step 2607, a determination is 

made as to whether there has been recent viewer activity at step 2609. If there was no activity at 
h«= step 2609, the session ends at step 2613. If there was activity at step 2609, a determination is 
i"l made as to whether the power was turned off at step 261 1 . If the power was not turned off, the 
j*r next event is processed at step 2605. If the power is turned of fat step 26 11, the current session 
hko is ended at step 2613. If the power is turned off at the end of the session at step 2615, the session 

detection process ends at step 2617 (i.e., until the next power on event). Otherwise, a new 

session is started at step 2603. 

FIG. 27A illustrates viewing sessions being determined based on when the STB 220 is 
powered on or off. A first session 2710 starts when the STB 220 is turned on 2712 and 

2 5 terminates when it is turned off 27 14. A second session 2720 begins the next time the STB 220 

is powered on 2722 and ends at the next power off event 2724. 

FIG. 27B illustrates sessions being defined based on fixed length windows of time. As 
illustrated, a first session 2750 starts when the STB 220 is powered on 2752. The first session 
2750 extends for a fixed amount of time, such as the length of the average program or until the 

3 0 end of the current day part (illustrated as a first day part 2760). A second session 2770, is then 
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initiated at the start of a second day part 2780. The second session 2770 continues until it is 
terminated at the next power off event 2772. Alternatively, the session could have continued 
until the end of the second day part 2780. This method improves on the simple power on/off 
session definition described above as it handles STBs 220 that are left on for long periods of 
5 time. Also, because viewers change frequently at the transition point between day parts, this 
method handles the transition of one viewer to another. 

FIGs. 28A-C illustrate how viewing sessions can be defined based on viewer activity or 
inactivity, and how session start and end points can be based on remote control activity or 
inactivity. In Fig. 28A, the initiation of remote activity 2800, such as channel surfing, marks the 
10 start of a first session 2810, and occasional remote activities 2820 keep the session alive. 
f:;3 However, if the remote control is idle for an extended period of time (period of inactivity) 2830, 

the VCPS 290 considers the next activity from the remote control 2840 to be indicative of the 
!|3 start of a second session 2850. The period of inactivity 2830 used to determine a new session 
ifi can vary widely depending on the application and may be dynamically configurable. 

iji 5 FIG. 28B illustrates a case where the VCPS 290 immediately terminates the first session 

r , 2810, after a period of inactivity of a defined time 2860. The second session 2850 does not 

C3 begin until activity 2840 resumes. Viewers are not considered to be watching television during 

n the period of inactivity after the session ends, and therefore there is no session 2870. The period 

;«=' of inactivity 2860 can be specified as a fixed amount of time or a varying amount of time based 

20 on day, day part, network, etc. 

FIG. 28C illustrates an example of how the first session 2810 is ended based on a period 
of inactivity 2880, but only after a program currently being viewed has ended 2890. This 
method has the advantage that even though extended periods of inactivity occur, a viewer may be 
simply watching a particular program for an extended period of time and thus the session should 

2 5 cover these periods. As illustrated, this embodiment shortened the period associated with no 

session 2870 since the first session 2810 did not end until some time after (period of inactivity 
2880) the end of the current program 2890. As one skilled in the art would recognize there are 
numerous methods for utilizing the period of interactivity 2880 in conjunction with the end of 
program 2890 that are well within the scope of the current invention. For example, the period of 

3 0 inactivity 2880 may be defined as a time, such as 60 minutes or the end of a program 2890, 
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whichever is greater. Alternatively, the period of inactivity 2880 may be defined as 60 minutes 
or 30 minutes after the end of the program 2890, whichever is greater. Alternatively, the period 
of inactivity 2880 may be a total of 60 minutes or 30 minutes of inactivity after the end of a 
program 2890, whichever is less (as long as the program was over prior to 60 minutes of 
5 inactivity). 

Viewing sessions can also be defined based on the content accessed by the viewer and the 
interactivity behavior of the viewer. Examples of such interactivity behavior include but are not 
limited to, the rate of channel change, program type preferences, preferred networks, and 
program search patterns. The interactivity behaviors could vary by day part. Using the concept 
10 of a sliding window, viewer behavior changes, such as increased channel change frequency, and 
viewer preference changes, such as changes in programs or networks, can be detected and used 
ill to terminate a current session, initiate a new session, or both. A determination as to whether to 
,S start a new session based on the change of preferred networks can be made based on 
y3 predetermined rules regarding the networks (e.g., different networks attract viewers with 
Hi 5 differing demographic attributes), or could be based on previously stored session information 
and profiles and/or signature profiles (discussed further herein). 

j= T FIG. 29 illustrates the concept of a sliding window identifying session termination and 

initiation points. A sliding window 2900 is a window of specific duration 2910, which is 
Q continually translated forward in time along a time axis 2920. As the sliding window 2900 
'"'2 0 moves along the time axis 2920, data, such as preferred networks, within the sliding window 
2900 are monitored and compared with the same data for an overall session up to that point in 
time. Utilizing this comparison, the VCPS 290 can determine whether or not the viewership has 
changed based on how well the data (e.g., preferred networks) in the sliding window 2900 
compares with similar data for the entire session. When the viewing behavior within the sliding 

2 5 window 2900 is sufficiendy different from the overall session behavior, the VCPS 290 considers 

the data within the sliding window 2900 as part of a new session 

As illustrated on a first time axis 2920A, the sliding window 2900 includes networks 
DSC, ESPN, and FX. These networks are considered to be identifiable with a single viewer and 
thus are identified as session 1. As time progresses to time axis 2920B, the sliding window 2900 

3 0 includes networks FX, ESPN and Lifetime. These changes in networks are not yet enough to 
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trigger a change in session. However, as time progresses to time axis 2920C and the sliding 
window 2900 includes mostly networks Lifetime and A&E, the VCPS 290 initiates session 2. 
The determination as to whether to start a new session based on the change of preferred networks 
can be made based on predetermined rules regarding the networks (e.g., different networks 
5 attract viewers with differing demographic attributes), or could be based on previously stored 
session information and profiles and/or signature profiles (discussed further herein). 

This method of automatic session detection and transition is extremely powerful as it is 
driven by viewer habit and preferences, and occurs in real-time. Although in this example the 
preferred networks are monitored by the sliding window it is to be understood, as will be evident 

10 to those skilled in the art, that other classes of viewer interactivity and preferences or 

combinations of classes (weighted or non- weighted) could also be used with the sliding window 

% method to identify session transition. These other classes may include, but not limited to, 

y3 preferred program genre, networks, surf patterns, program categories, viewer type (i.e., male, 

fg female, child), volume adjustments, and EPG activity. 

SM5 In a preferred embodiment, the VCPS 290 uses two sliding windows, a small window to 

°~ detect radical changes in viewer behavior and a larger window to detect more subtle changes, 
^"f FIG. 30 illustrates implementing sliding windows by collecting data into fixed-width data bins 
hi termed sub-sessions. Each sub-session represents a unit of time during which viewer 

interactivity is monitored. A window consists of a fixed number of the sub-sessions, starting 
i-i 0 with the most recent. As the amount of time represented by a single sub-session elapses, the 
window slides so that it always incorporates the most recent set of sub-sessions. According to 
one embodiment, the VCPS 290 uses six 5-minute sub-sessions 3002-3012 with two sliding 
windows, one of 15 minutes 3020 (3 sub-sessions) and one of 30 minutes 3030 (6 sub-sessions). 

The fifteen-minute window 3020 has a high threshold of viewer behavioral change as it is 
2 5 meant to detect radical changes in the viewing behavior in a short period of time. The thirty- 
minute window 3030 detects more subtle changes in behavior and has a lower threshold for 
change detection. Each sub-session is labeled with a range of minutes, which indicates the 
amount of time that has elapsed since session began. Two additional sub-sessions, represent data 
that is outside the windows 3020, 3030 (events that occurred 30+ minutes ago). One sub-session 
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stores events that occurred over 30 minutes ago for the current day part 3040. The other sub- 
session stores data that occurred over 30 minutes ago for this session 3050. 

As events occur, the data is stored in the 0-4 minute sub-session 3002. After 5 minutes 
has elapsed, the data for all sub-sessions except the 30+ minute sub-sessions 3040, 3050 is 
5 shifted to an adjacent bin (as illustrated, shifted to the right). Data coming out of the 25-29 
minute bin 3012 is added to both the 30+ minute day-part bin 3040 and the 30+ minute session 
bin 3050. The 0-4 minute sub-session 3002 is cleared. The 15-minute sliding window 3020 
encompasses the first three sub-sessions 3002-3006 while the 30-minute sliding window 3030 
encompasses the first six sub-sessions 3002-3012. 

10 In addition to collecting session data in order to generate a session profile, which is 

limited to a viewing session and which is collected over a relatively short period of time, the 
=13 VCPS 290 generates and updates a more complete profile of the user by combining current 

session data with historical data for this viewer. This aggregated viewer or household historical 
data is referred to as a signature. A signature can best be described as the aggregation of session 
ill 5 data for a particular type of viewer or household over an extended (multiple sessions) period of 
r time. A single signature does not necessarily correspond to a single unique viewer in a 
Q household. The combination of multiple members of a household watching television 
simultaneously may result in a household signature, different than any of the individual 
signatures. 

2 0 Also, members of the household with similar viewing habits may be aggregated into a 

single signature. It is also possible that different viewing habits for a single member of the 
household can result in multiple signatures for that member. The VCPS 290 stores multiple 
signatures, in order to account for multiple viewers and subsets of those viewers within a 
household, each signature representing at least a subset of different viewing characteristics. In a 

2 5 preferred embodiment, the VCPS 290 stores up to fifteen unique signatures per household, and 

each signature may correspond to a particular viewer, a group of particular viewers, or to the 
entire household of viewers, and the VCPS 290 retains the signature information in long-term 
storage (e.g., on a hard drive or in non-volatile memory). 

In a preferred embodiment, the VCPS 290 generates and stores viewer and household 

3 0 signature profiles based on one or more session profiles. When the VCPS 290 is first initiated, 
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and there are no session or signature profiles, the VCPS 290 generates the first signature profiles 
from the first session profiles. Then, as the VCPS 290 monitors and processes data from 
subsequent viewing sessions, it will continually attempt to correlate the session data with a 
specific signature, and associate the long term viewing habits and other interactivity of a 
5 particular viewer or household with short term viewing habits and interactivity. In an alternative 
embodiment, signatures may be loaded into the STB 220 externally (either when the unit is 
manufactured, purchaser, or installed or during use). The loaded signatures may represent 
characteristics that advertisers are looking for, be standard signatures that identify distinct market 
segments, be representative signatures for a particular area, or other representations that would 

1 0 still be within the scope of the current invention. The preloaded signatures may be updated to 
take into account session profiles that are deemed to correlate with the signature profiles or may 

O stay the same until updated by the network operator or other responsible party. 

^0 In a preferred embodiment, the signature profile comprises the same profile categories as 

fQ the session profile allowing for a direct correlation between categories. The VCPS 290 writes 

js session data to the signature history at the end of day parts and sessions. Each 5-minute sub- 

§3 session tracks the day part that it resides in, and the 30-minute day-part sub-session 3040 does 

Isi the same. When data is being added from the 25-29 minute sub-session 3012 into the 30+ 

}'i rninute day-part sub-session 3040, the VCPS 290 verifies that the sub-sessions are the same day 

H part 3060. If the sub-sessions are for a different day part, the data from the 30+ minute day-part 

i'2 0 bin 3040 is written to the signature data that best matches the current session and the 30+ minute 
day-part bin 3040 is cleared. 

Because the VCPS 290 tracks all of the session data and also must write day part data to 
the signatures, the 30+ minute session sub-session 3050 stores all data for the current session that 
is outside the sliding windows. This data is used for comparing changes in session behavior. 

2 5 When a session terminates due to behavior change, data within the sliding window represents a 

new session while data outside that window is the old session. 

If the 30 minute window 3030 detected the change in session, the 30+ minute day-part 
sub-session 3040 is written to the signature file and the 30+ minute day part bin 3040 and 30+ 
minute session bin 3050 are cleared. Note, the 30+ minute session bin 3050 is not required to be 

3 0 written to the signature file as the 30+ minute day part bin 3040 is written to the signature file. 
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and all previous day parts that were part of this session were previously written to the signature 
file. 

If the 15-minute sliding window 3020 detected the behavior change, then data in the 5- 
minute sub-sessions outside the 15-minute sliding window 3020 (last 3 five-minute sub-sessions 
5 3008-3012) are also written to the signature file and cleared. When a session terminates due to 
power off or lack of user interaction, all data in the 5-minute sub-sessions 3002-3012 and the 
30+ minute day-part sub-session 3040 are written to the signature file and cleared. 

FIGs. 31A and 3 IB illustrate an exemplary process flow of how the sliding windows are 
managed and events are associated with sessions according to one embodiment. A new event 
10 occurs at step 3101. A determination is made as to whether it is the end of a bin (i.e., 5 minutes 

of data saved in current bin) at step 3103. The new event will be with the current sub-session 
15 (bin) until the end of the sub-session (bin). If the current sub-session is still active, a 
S determination is made as to whether the power is turned off at step 3 105. If the determination of 
step 3105 is yes, the data collected for the current sub-session will be saved to a signature file at 
step 3113 and the session ends at step 3115. Otherwise, the event will be processed using the 
filters at step 3 107 and the profile characteristics derived will be added to the current sub-session 
1,1, at step 3 109. The processing of the event is complete at step 3111. 

yj If the new event occurs at the end of a sub-session at step 3 103, then a check is made as 

js- to whether the leading edge of the sliding window coincides with the end of a day part at step 
I'S 0 31 17. If they do not coincide, then the last sliding window sub-session is shifted into the 30+ 
minute session bin at step 3119. If they do coincide, then the day part profile is saved to a 
signature file at step 3127, the data is cleared from the 30+ minute day part bin at step 3129, and 
then the last sliding window sub-session is shifted into 30+ minute session bin at step 3119. 

At step 3121, a determination is made as to whether there was any change in viewing 
25 pattern. If there was no change, all sliding window sub-sessions are shifted at step 3123 and the 
current bin is cleared at step 3 125. The data from the ending sub-session is then processed via 
the filters at step 3107 and added to the profile for that sub-session at step 3109. If a change in 
viewing pattern is detected, then the session data is saved to a signature file at step 3131 and a 
new session is started at step 3133. All sHding window bins are then shifted at step 3133. 
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As one of ordinary skill in the art would recognize, there are numerous methods for 
performing the determination of step 3121 that are well within the scope of the current invention. 
For example, the determination can be made by comparing the current viewing characteristics 
with the session profile, or, if applicable, with the signature profile associated with the session 
5 profile. The current viewing characteristics may be defined as a single element of time or may 
be multiple elements of time for purposes of comparison. The embodiment described above 
used two periods of time (last 15 minutes and last 30 minutes) for the current viewing 
characteristics. It should be noted that the current invention is not limited to these time frames. 
According to one embodiment, these time frames would be an adaptable parameter of the 
1 0 system. 

If multiple time frames are used the thresholds for determining a change in viewership 
aj would be different. As described above, the shorter window of time would be used to detect 
iij radical changes while the longer window of time would be used to detect more subtle changes. 

The comparisons may be based on one or more viewing characteristics. If multiple viewing 
''15 characteristics are used in the comparison, the different characteristics may have different 
weighting factors applied. If multiple windows of time are used, the same elements may be 
compared, the same element with different weighting factors may be compared, or different 
y j element may be compared. 

Q FIGs. 32A and 32B illustrate how session data is compared with the signature data 

' 2 0 already stored in the VCPS 290. As data is collected about a session, the VCPS 290 continually 
calculates (in real or near-real time) the session profile and compares that to existing signature 
profiles. Fig. 32A illustrates a session profile 3200 and correlation factors 3210 for existing 
signature profiles 3220. As illustrated, the session profile 3200 has the best correlation with 
signature 4 (77%). This iterative process continues until the session terminates. According to 

2 5 one embodiment, the VCPS 290 compares the current session data against the signature data 

every five minutes to find the signature that correlates best with the session. The "best" 
signature match may change several times while session data is being collected. This is to be 
expected since the accuracy of the data increases as the amount of viewing data increases. It 
should be noted that although the session profile is illustrated as tracking networks, it is to be 

3 0 understood that the actual correlation of a session to a signature may entail correlating various 

category profiles, combinations of category profiles, or weighted combinations of category 
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profiles. According to one embodiment, the correlation factors are adaptable. It would be within 
the scope of the current invention for the user, the network operator or other parties to adapt 
these parameters. 

When the viewing session terminates, the current session profile 3200 is added to the 
5 signature profile 3220 that had the best correlation 3210. The VCPS 290 performs a weighted 
average between the session profile 3200 and the signature profile 3220 so that the data is 
properly scaled based on overall viewing time. Thus, the signature profiles 3220 that represent 
long periods of viewing history will only be slightly altered by new viewing data. In addition to 
adding the session profile 3200 to the associated (highest correlation above threshold) signature 
10 profile 3220 at the end of the session, the VCPS 290 may also add the current session profile 

3200 to the associated signature profile 3220 at the end of day parts, the end of programs, end of 
'^fi days or defined time periods. 

i|3 FIG. 32B illustrates a case where the session profile 3200 does not correlate with any of 

the existing signature profiles 3220 (none of the correlation meet a minimum correlation 
' :M5 threshold). Therefore, the session profile 3200 is used to create a new signature profile 3220 that 
„ " is stored in one of the empty signature bins 3230. 

fi According to one embodiment, once a matching signature has been established, a fully 

f f populated 15-minute sliding window 3020 (all three 5 minute bins 3002-3006 are filled) is used 
i3 to determine viewer behavior changes. The VCPS 290 compares the data in the 15-minute 
2 0 sliding window 3020 against the signature data. The data within the 15-minute sliding window 
3020 must differ significantly (e.g., exceed a difference threshold or not meet a correlation 
threshold) from the signature data for the VCPS 290 to terminate the session. As would be 
obvious to one of ordinary skill in the art, reducing the difference threshold (or increasing the 
correlation threshold) would likely result in earlier termination of sessions, and increasing the 

2 5 difference threshold (or decreasing the correlation threshold) would likely result in longer 

sessions. 

In another embodiment, once the matching signature profile has been identified the 
VCPS 290 may also use a fully populated 30-minute sliding window 3030 to determine viewer 
changes. As the 30-minute sliding window 3030 is designed to detect more subtle changes, the 

3 0 difference threshold is accordingly lower (or correlation threshold is accordingly higher) than for 
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the 15-minute sliding window 3020. If the 30-minute sliding window 3030 and the signature 
exceed the difference threshold (or fall below the correlation threshold) the session is terminated. 
In an alternative embodiment, the 30-minute sliding window 3030 is next compared to the 
overall session before a termination decision is made. If this difference threshold is also 
5 exceeded (or falls below the correlation threshold) the session is terminated. The difference (or 
correlation) thresholds for both the 30-minute sliding window-vs.-signature and 30-minute 
sliding window-vs.-overall session may be the same or may be different (to account for the hkely 
differences in the session profile and the signature profile). 

FIG. 33 illustrates an exemplary graphical user interface (GUI) comparing session data 

1 0 and signature data. The GUI includes a session portion 3300, a best match portion 3340 and a 

C3 signature summary portion 3380. The session summary portion 3300 includes graphs for genre 

% 33 10, surf dwell time 33 15, network by day part 3320, category 3325, and viewer type 3330 for 

2; the session, and also includes total time 3335 associated with the session. The best match 

Ji portion 3340 includes graphs for genre 3350, surf dwell time 3355, network by day part 3360, 

Is category 3365, and viewer type 3370 for the signature, and also includes total time 3375 

s associated with the signature. According to one embodiment, the session data for genre, surf, 

q network and category is over laid on the signature data for the same. In a preferred embodiment, 

: the GUI would use different colors, one color for session data, another color for the signature 

0 data, and a third color illustrating the overlap between the session data and the signature data. 

2 0 The signature summary portion 3380, illustrates a number of signatures identified by 

their unique ID 3385, total time 3390 and correlation score 3395. The number of signatures 
displayed may be the top signatures (i.e., top 5), may be all of the signatures stored therein, or 
may be a predetermined number of signature slots. As illustrated, there are 15 signature slots 
displayed with the last three being empty signature slots. While it would be possible to store a 

2 5 large amount of signatures on the STB, there is a practical Kmit to how many should be stored 

thereon. The limit would be based on the memory in the STB as well as the granularity that is 
desired in the different signatures. As previously discussed, session monitoring and signature 
matching preferably occur in real time (or near-real-time), as session data is being received, 
processed, and continuously compared with signature data. Thus, the correlation scores are 

3 0 updated in real time (or near real time). 



Patent Application 



-46- 



T734-10 



The correlation scores 3395 may be based on the conrelation of a single element, multiple 
elements, or multiple weighted elements. According to one embodiment, the correlation score is 
based on the program genre, surf dwell time and networks. These elements may be weighted 
with program genre having the heaviest weighting factor. While the viewer type is not used in 
5 the correlation score in this embodiment, it may be used to exclude potential matches between a 
session and a signature. That is, if the session viewer type is significantly different from the 
signature viewer type, the match will be rejected. If the match is right on the threshold for 
acceptance, or no match can be found, the VCPS 290 examines the category data to assist in the 
session to signature match determination. For instance, if the session has a large amount of 
1 0 viewing time in a small set of categories, then the VCPS 290 can search the signature data for 
high viewing time in those categories. 

FIG. 34 illustrates an exemplary process flow of how, at the end of a session, session data 
;J3 is saved and matched to a signature. A session ends at step 3401, and then a determination is 
fS made as to whether or not the length (i.e., duration) of the session was at least a certain amount 
^5 of time (i.e., 15 minutes) at step 3403. If not, then the session data is discarded at step 3419 as 
;|3 such a short session is not considered to be of much value. If the session duration is greater than 
5^=^ 15 minutes, then a determination is made as to whether or not the session data correlates (i.e., 
;=J meets a correlation threshold) with a signature at step 3405. If the session data does correlate 
i>fe with a signature then the session data is added to the matching signature at step 3417. If the 
fi 0 session data does not correlate with a signature, then a determination is made as to whether the 
signature list is full at step 3407. If the signature list is not full (i.e., the number of stored 
signature has not exceeded a preset maximum) then the session data is saved as a new signature 
at step 3415. If however the signature list is full, then the session and existing signatures are 
"scored" based on viewing time and access frequency at step 3409 (e.g., a session or signature 
2 5 with larger viewing time and/or click frequency contains more data with which to profile and 
thus receives a higher score). A determination is made as to whether the session score is the 
lowest at step 341 1. If the session "scores" the lowest, then the session data is discarded at step 
3419. Otherwise, the signature with the lowest score is discarded at step 3413 and the session 
data is saved as a new signature at step 3415. 

30 It is to be noted that the session duration parameter (step 3403) used to determine 

whether or not the session data should be discarded is not required to be 15 minutes, but could a 
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much smaller or much larger unit of time, or could be an adaptable parameter without departing 
from the scope of the current invention. Also, the minimum correlation threshold used to 
determine whether a session and a signature correlate (step 3405), could be set to a relatively 
high or low value, or could be based on one or more parameters as discussed above without 
5 departing from the scope of the current invention. These parameters could be adaptable and set 
depending on implementation preference. Moreover, as one skilled in the art would recognize 
there are numerous methods for scoring the signatures (step 3409) that would be well within the 
scope of the current invention. 

According to one embodiment, the VCPS 290 may consider the signatures with the most 
1 0 viewing time to be off limits to purging. If a signature has a large amount of viewing time but 

the viewer that created it is no longer with the household, the signature will eventually be able to 
5i be deleted after the system accumulates viewing time for the active signatures. For the 
*f signatures that can be deleted, a combination of total viewing time and date that the signature 
IB: was last updated contribute to the signature that the system selects for purging. A STB box has 
j5 N slots for signatures and only signatures in the top N/2 slots can be considered off limits for 
purging. The system then calculates the total viewing time T for all signatures. If a signature is 
in a top slot and it has a viewing time of at least T/2N, then the signature cannot be deleted. 

yj For example, assume that the STB has 15 signatures (N=15) stored therein and has 

IZ compiled 50,000 seconds of total viewing time for all signatures (T=50,000). A signature cannot 
H 0 be purged if it is within the top 7 signatures (15/2) of viewing time and the signature viewing 
time is at least 1,667 seconds (50000/30). 

The signatures that can be deleted will be assigned a score based on the viewing time and 
date of last update. Given a viewing time of V and number of days since last update of D, then 
the score will be V/((D/10)+1). Thus basically states that every 10 days of signature inactivity 

2 5 reduces the value of the viewing time. For example, if the viewing time is 2000 seconds and it 

has been 12 days since the last update, the score will be 1000 [2000/((12/10)+l)]. Note that the 
calculation uses integer math, which rounds the result down. Thus, the denominator includes a 
+1 to ensure that the value will not be 0, 

The following example demonstrates all of these concepts. This example has six 

3 0 signature bins and a total viewing time of 60,000 seconds. Thus, the top three signatures that 



Patent Application 



-48- 



T734-10 



have a minimum signature time of 5000 seconds (60000/12) may be marked as off limits. 
However, as illustrated only two of the bins meet the viewing time and are marked off limits. 
Out of the remaining four signatures, signature 5 has the lowest score of 2000 
[4000/((16/10)+l)]. 



Slot Time (s) Davs Since Update Score Description 

1 30000 20 n/a off limits 

2 15000 0 n/a off limits 

3 4500 11 2250 

4 4000 8 4000 

5 4000 16 2000 deleted 

6 2500 2 2500 



Figs. 35A-B illustrate an exemplary method for updating the signature profile with the 
session profile based on activity (events captured in the event queue). After the process starts 
(step 3501), the event queue is initialized (step 3503), and the VCPS 290 waits for the next event 
(step 3505). Once the next event is received, the VCPS 290 determines if the event is a power 
on event (step 3507). If the event was a power on event, the VCPS 290 determines when the 
sliding window will end (i.e., 15 minute window and 30 minute window) and adds these times as 
upcoming events in the event queue (step 3509). The power on event is then processed (step 
3517). If the event was determined to not be a power on event, a determination of whether the 
event was a channel change event is made (step 35 1 1). If the determination is that the event was 
a channel change, the VCPS 290 deletes the end of program event associated with the previous 
channel from the event queue (step 3513) and processes the channel change (step 3517). If the 
event was not a channel change, the VCPS 290 determines if the event was an end of program 
event (step 3515). The end of program determination is made by comparing the current time to 
the time marked in the event queue for the end of the current program. If the event was an end of 
program, then the end of program event is processed by the VCPS (step 3517). 

After the event (power on, channel change, end of program) is processed at step 3517, the 
profile characteristics associated with the event are added to the current session profile (step 
3519). In accordance with a preferred embodiment of the current invention, the characteristics of 
the event are added on a time-weighted basis. Based on the current channel that is tuned to, the 
VCPS 290 determines when the current program is scheduled to end (based on program data) 
and adds an event for the end of the program to the event queue (step 3521). 
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If the event was determined not to be an end of program event, the VCPS determines if 
the event was an end of bin event as established in step 3509 (step 3523). If the event was an 
end of bin event, the end of bin event is processed (step 3525) and the characteristics of this 
event are added to the current session profile (step 3527). Next a determination is made as to 
5 whether it is the end of a day part (step 353 1). If it is determined that it was the end of a day 
part, the current session profile for that day part (i.e., bin 3040) is saved to the associated 
signature profile (step 3531) and then cleared (step 3533). Regardless of whether, the event was 
an end of day part or not, the data within the sliding windows is shifted to the next bin (step 
3535). Referring back to Fig. 30, the contents of bin 3002 is shifted to bin 3004, 3004 to 3006, 

10 ... and 3012 into 3040 and 3050. Now that the shding window has been shifted, the VCPS 

O determines the end of the sliding window and adds that data to the event queue (step 3537). The 
Jl process then returns to step 3505 (waiting for next event). 

11 If the event was not an end of pin event, the VCPS 290 determines if the event was a 
?1 power off event (step 3539). If the event was not a power off event, the process retums to step 
^5 3505. If the event was a power off event, the power off event is processed (step 3541), the 

characteristics of this event are added to the current session profile (step 3543), the session 
H profile is added to the associated signature profile, preferably on a time weighted basis (step 
3545), and the event queue is cleared (step 3547). The process is complete at step 3549. 

= As one of ordinary skill in the art would recognize, this process is merely an exemplary 

2 0 process. The steps of the process could be rearranged, steps could be removed or added, or 

completely different steps could be used to accomplish the same or similar result without 

departing from the scope of the current invention. 

According to one embodiment, the VCPS 290 also periodically compares each signature 

with every other signature, in order to identify signatures that may correlate with one another. 

2 5 For example, as session data is added to existing signatures, it is possible that some of the 

signatures will begin to correlate with one another, indicating that the signatures actually 
represent the same viewer or viewer type. In one embodiment, the VCPS 290 handles this case 
by attempting to correlate signatures after every signature update. These updates preferably 
occur at the end of day parts and the end of sessions. The process of merging or joining two 

3 0 correlated signatures is accomplished by comparing the recently updated signature with every 
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other signature stored by the VCPS 290. The signature that best matches (according to a 
predefined threshold) will be merged with the recently updated signature by calculating the 
weighted average of the signature data. Additionally, the VCPS 290 preferably performs the 
process iteratively, attempting to merge the new signature with all other signatures. This 
5 iterative procedure is carried out because combining the two signatures may have altered them 
enough to correlate with another signature. 

According to one embodiment, the VCPS 290 compares most of the elements of the 
signature data in order to determine if signatures correlate with one another. Unlike the session 
to signature comparisons, category data is used in the signature correlation process because the 

1 0 signatures represent many sessions, and the correlation of category data is particularly useful 
when comparing large data sets. However, viewer type is not used to correlate signatures, it is 

=13 only used to exclude matches. That is, if the VCPS 290 determines that two signatures correlate, 
;S the VCPS 290 will look at viewer type. As long as the viewer type is not significantly different, 
m the VCPS 290 will merge the two correlated signatures. However, if the viewer type is 

11 significantly different (i.e., signature 1= .8 male, .1 female, .1 child and signature 2=-. I male, .8 
female, . 1 child) then the VCPS 290 will not merge the two signatures. 

f =5 The VCPS 290 can perform the comparisons either after updating a signature with 

session data or as a background task. Performing the comparisons after an update will require 
g only comparing the updated signature with the other signatures. Background comparisons must 
20 compare all signatures against all others. If two or more signatures correlate beyond a certain 
correlation threshold, the VCPS 290 combines signatures by performing a weighted average of 
the data from each signature. Such a correlation threshold may be dynamically configurable 
(e.g., from instructions from the HE). 

In an alternate embodiment, standardized or customized signa|ures are downloaded from 
25 the HE 210 and stored at the STB 220 for utilization by the VCPS 290. Such custom signatures 
may be derived based on, for instance, expected or predicted viewer behavior, or alternatively, 
these signatures may represent aggregate session data from multiple households. An example of 
such a signature is one that reflects one or more demographic traits shared by a cluster or group 
of households, which may be identified by analyzing session profile data (or signatures) from an 
3 0 array of households, and grouping households together that have highly correlative session or 
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signature profiles. Session profiles can then be correlated with these "standard" signatures (e.g., 
a form of collaborative filtering) as well as with locally generated signatures as described above. 

As discussed above, the VCPS 290 correlates profiles with other profiles, including, but 
not limited to: 

5 • session profile v. itself - to determine when one session is terminated and another 

is began; 

• session profile v. numerous signature profiles - to determine which, if any, 
signature profile the session profile should be added to; 

• a session profile v. matching signature profile - to determine when a session is 
1 0 terminated and another is began; and 

• updated signature profile v. other signature profiles - to determine if updated 
signature profile now correlates with the other signature profiles. 



ill 



There are numerous ways that profiles can be correlated with each other. The correlation 
III can range from simple to complex. The complexity of the correlation depends on such factors as 
?1 what is being correlated, and how much data is contained within the profiles. For example, a 

correlation of a session profile against itself may be quite simple. That is, a single profile (i.e., 
iss network profile) may be monitored to determine when a session ends. On the other hand, a 
H correlation of a session profile to a signature profile may be more complex and may correlate 
1^) numerous profiles (i.e., genre, networks, category, surf) to determine when there is a match. 
2 Some possible profile correlations and the value of each are discussed below. 

Genre profiles indicate the amount of time that a viewer watches each type of program 
genre, in effect weighting the importance of that program type to the viewer. The sessions and 
signatures track the viewing times of each genre in an array. The correlation compares the 

2 5 elements of each of the genre arrays. As it is believed that viewers generally watch the same 

types of programs, using the genre in the correlation calculation provides a highly accurate 
method for quickly comparing two profiles. 

Channel surf dwell time profiles capture how long a particular channel or network will 
maintain the viewers interest during a surf routine. Because a channel surf sequence may only 

3 0 take one or two minutes, a major benefit of tracking channel surf behavior is that the dwell time 

data is collected over a short period of time when compared with viewing interests. The sessions 
and signatures track dwell times by network, and each one has an array of both the number of 
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times a network is visited during the surf sequence and the total dwell time on each network, in 
order to calculate an average dwell time per network per channel surf sequence. The average 
dwell time per network is correlated. 

Network profiles track the amount of time that the viewer watches a particular network 
by day part and day of week. Because a viewer will typically watch the same programs at the 
same time every week, correlating the data by the appropriate day and day of week results in 
matching a viewer session to the appropriate signature. In one embodiment, the session-to- 
signature correlation compares the network information for the current day and day part against 
the signature data for the same time. The signature-to-signature correlation used to merge 
signatures compares all network day and day part data. The average correlation is used to help 
determine whether the signatures match. Since the signatures aggregate data over many 
sessions, the network viewing by day part should be similar if the signatures actually represent 
the same viewer. 

Program category profiles are much more specific than genres and can be utilized, for 
instance, if a session cannot be matched with a signature or only produces marginal matches, to 
match a session with a signature. For example, if the VCPS 290 collects data reflecting a large 
amount of viewing time in a small set of categories, the VCPS 290 can search the history for 
signatures with large viewing times in those same categories. The search results can be used to 
determine if a marginal match between session and signature is actually a match. It can also help 
classify a session that would not otherwise correlate with a signature. 

As would be obvious to one of ordinary skill in the art, the amount of data gathered for 
each session and each signature is dependent upon the amount of memory in the STB 220, the 
amount of granularity (and potentially accuracy) desired. A tradeoff needs to be made between 
the amount of memory required and the granularity/accuracy desired* At^cordingto one . 
embodiment, each session profile includes the following category profiles: ^ » " 

• genre for overall session; 

• network viewership for current day part; 

• channel surf sequence for overall session; 

• category data for overall session; and 

• viewer type for overall session. 
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According to one embodiment, each signature profile includes the following category 
profiles: 

• genre for overall history; 

• network viewership for prime time for each weekday (Mon - Fri); 

• network viewership for evenings for all weekdays; 

• network viewership for late night for all weekdays; 

• network viewership for daytime for all weekdays; 

• network viewership for weekends; 

• channel surf sequence for overall history; 

• category data for overall history; and 

• viewer type for overall history. 

The VCPS 290 may track prime time viewership (8PM - 1 1PM) for the weekdays, 
evening viewing (6PM- 8PM) for all weekdays, late night viewing (11PM - SAM) for all 
weekdays, daytime viewing (SAM - 6PM) for all weekdays and weekend viewership. 
According to this embodiment, when comparing session data to signature data, the VCPS 290 
compares the following data as the primary match criteria: session genre data and signature genre 
data; session network data by day part with signature network data by same corresponding day 
part; and session surf dwell time with signature surf dwell time. Each element or category may 
be weighted based on the importance of the data in the correlation. Genre has the largest 
weighting factor in this embodiment, but other relative weightings are also possible. The 
correlation result is a score for the signature match. The signature that has the highest 
correlation with the session is considered the best match, and the sessioq-data may be associated 
with that signature. 

The signature profiles may be used to target content (advertisements, pay per view (PPV) 
events, video on demand (VOD) programming) to the subscribers or to customize their viewing 
environment (i.e., favorite programs listed first in EPG, format of EPG). The signature profiles 
stored in the STB 220 may be transmitted to the HE at determined intervals (i.e., every night at 
off-peak hours, every week). The HE may aggregate the data received from each of the STBs 
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220 connected thereto in order to form groups of viewers with similar characteristics. The 
groups having similar characteristics may receive specific content targeted to their characteristics 
(signature profiles). According to another embodiment, the HE may also use other external data 
to help group subscribers. The external data may be in the form of geodemographic data, such as 
the demographic data associated with distinct ZIP+4 geographies provided by Claritas (see 
Docket #s T726-00, T719-00 and T741-10 which were previously referred to and incorporated 
by reference for additional details). The external data may also be in the form of other 
transactional profiles, such as purchase transactions (see Docket #s T702-03, T706-00 and T741- 
10 which were previously referred to and incorporated by reference for additional details). 

The groups may be formed based on the topography of the television viewing network 
(i.e., based on nodes within the system). The groups would then consist of clusters of 
subscribers, wherein each of the clusters could be associated with a node, branch or other 
element of the system (see Docket #s T737-00 and T741-10 which were previously referred to 
and incorporated by reference for additional details). In an altemative embodiment, the STB 
may receive the additional data (i.e., geodemographic data, other transaction data) directly and 
incorporate this data into the generated profiles (i.e., signature profiles) to develop enhanced 
profiles. These enhanced profiles can then be forwarded to the HE for aggregation and potential 
grouping as described above. 

Regardless of where the enhanced profiles are generated, there is likely a practical limit 
to how many different groups are formed (i.e., 5). The groups may be formed and the 
advertisements targeted thereto, or ad profiles defining characteristics about the target market of 
the ad are created and the users are grouped to these profiles. As discussed above, each of the 
groups may receive material targeted for them. In one embodiment, everybody will receive all 
content and will select the appropriate content for display based on matching the jroup number 
associated with the content and the group number of the STB, or particular user or group of users 
interacting with the STB. In a preferred embodiment, each STB will receive only thecontent 
associated with it based on what group it falls in, thus saving bandwidth. If the content is 
targeted ads, the targeted ads may be inserted in place of the default ads to create a plurality of 
presentation streams (programs with targeted ads). Local cable companies have the equipment 
necessary to insert targeted ads as they are permitted to substitute local ads in place 
approximately 20% of the default ads. In the preferred embodiment, it is necessary for the 
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television system to be able to route the different presentation streams to different areas (i.e., 
nodes, branches) within the system so that each user only receives the appropriate presentation 
stream (see Docket #s T708-01, T708-01PCT, T708-02, T712-10, T712-10PCT, T721-20 which 
were previously referred to and incorporated by reference for additional details). 

In an alternative embodiment, the STB will insert specific content (i.e., targeted ads) 
locally. Thus, each subscriber may, in effect, have ads targeted directly to them. The ads may 
be inserted in avails (advertisement opportunities) regardless of the programming being viewed. 
The STB may receive the targeted ads via an ad channel. The ads may be delivered ahead of 
time and stored on the STB, or be received at approximately the same time as the avail and 
inserted on the fly. If the ads are to be stored on the STB, the ad channel may deliver a plurality 
of ads on the ad channel with the STB selecting the appropriate ads to store thereon. The 
selection may be made based on group designations or may be made by comparing the ad profile 
with the signature profiles on the STB. In addition to the ads, an ad queue will need to be 
generated and stored on the STB to define what order the ads are inserted and what criteria affect 
the criteria (see Docket #s T721-10, T721-10PCT, T721-12, T721-17, T721-20 and T721-22 
which were previously referred to and incorporated by reference for additional details). 

In an alternative embodiment, the ad queue may be stored on the STB while the ads are 
stored somewhere else within the network. When the ad queue determines the next ad to be 
inserted, the ad is retrieved from the network and delivered to the STB. The ad may be delivered 
over an ad channel or over a dedicated line (such as an Internet connection). If the ad channel 
was used the ads may be delivered in real time (would require coordination between program(s) 
and ad channel), or ahead of time and temporarily stored (i.e., start delivering one avail in 
advance). The ads could be delivered on the ad channel either at the actual bit rate for the ad or 
at a slower bit rate where portions of the ad are stored as they are rec^v^d. As one skilled in the 
art would recognize, the Internet connection provides both benefits and drawbacks. The . ^ _ 
drawback is the need the separate connection. The benefit is that the queue points to the URL 
address for the ad so the ad is displayed upon selection from the queue (see Docket # T721-16 
which was previously referred to and incorporated by reference for additional detail). 

In a preferred embodiment, the ad queue is actually a plurality of ad queues with a 
different queue associated with each signature, or group of signatures. When the VCPS 290 
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determines that a session has ended and thus a new user or group of users (identified by a 
correlation with a new signature) is interacting with the TV, the ad queue changes to the one 
associated with that signature. Moreover, the ad queue may be adjusted based on the programs 
being watched as certain advertisers may not want ads displayed during certain programs or may 
pay a premium for insertion in other programs (See Docket # T721-19 which was previously 
referred to and incorporated by reference for additional detail). 

According to another embodiment, the STB may also store ads and ad queues for other 
types of targeted advertisements such as EPGs, program bugs (overlay in the comer of the 
video), product placement (illustration of product placed in program, such as a Coke can placed 
in actors hand), trick play ads (shortened version of ad displayed when user fast forwards 
through recorded ad), and record ads (altemative ads placed in a recorded program if it is 
determined that different subscriber is viewing or if it is being viewed for more than the first 
time). The ad queue could be used to manage how all of these ads are displayed so as not to 
saturate the viewer but yet enhance the advertisement effectiveness. The various types of ads 
and possible coordination therebetween are discussed in Docket #'s T727-00, T727-10, T727- 
lOPCT, T728-10, T738-00, and T738-01 which have all previously been referred to and 
incorporated by reference for additional detail. 

Regardless of what is being correlated (ad profile to signature profile, session profile to 
signature profile), how many different elements are being correlated (program genre; program 
genre, channel changes and networks), and whether the different elements are weighted or not, 
there are numerous ways to perform these correlations. If the profiles are in the form of vectors 
you can perform a scalar dot product on the vectors to determine the correlation between the two 
vectors. In order for a scalar dot product to work the vectors must be normalized. That is, the 
magnitude of the vector must be 1.0. To calculate the magnitude, talp^ the square-root of the sum 
of the squares of the components as shown below. - - 



To normalize the vector, divide each component by the magnitude of the vector. Sample 
dot products are shown below. 
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One problem with the simple scalar dot product is that, it is possible to have vectors that 
O do not match each other have the same scalar dot product as vectors that are identical (first 
S illustrated vector above). 

m According to another embodiment, the variance (o^) and standard deviation (a) of vectors 

may be used to calculate the correlation. Assuming the two vectors that we wish to correlate are 
y3 labeled A and B, the variance is defined by the following function: 

□ The equation produces a low variance if there are only small differences between the 

f 0 components of each vector. This indicates a high correlation. A variance of zero indicates that 
there is no difference between the two vectors. Since each distribution vector is normalized, the 
square of the difference of each component will be in the range of 0 through 1. Since the 
variance is the average of this value, the variance will be limited to the same range. The actual 
variances will be small since the differences between components of ft distribution will be small, 
15 and the differences are squared. ^ « 

The standard deviation, a, is the square root of the variance. Since the variance will be 
very small, the square root will increase the value. Therefore, it is suggested that the standard 
deviation be used as the basis of the correlation value. As a correlation of 1.0 would reflect a 
perfect match a standard deviation of 0.0 is a perfect match. Since the standard deviation will be 
2 0 constrained to values between 0 and 1, the following equation is suggested for the correlation: 
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correlation = l-o 

Using the above examples, the correlations are now calculated as: 
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The above examples of correlation methods are exemplary in nature and in no way are 
intended to limit the scope of the invention. As would be obvious to one or ordinary skill in the 
art, there are numerous other methods for determining the correlation between profiles that 
would be well within the scope of the current invention. 

As will be evident to those of ordinary skill in the art, the VCPS 290 appUcation and 
other software components, can be implemented in a variety of software languages, including C, 
C+-1-, and Java. Moreover, the VCPS can be implemented on and/or integrated with a variety of 
STB platforms, including those boxes produced by Motorola (formerly General Instrument) and 
Scientific Atlanta, and with a variety of operating systems, including VxWorks and PowerTV, 
and with a variety of middlewares such as those produced by Liberate and OpenTV. 

Although this invention has been illustrated by reference to specific embodiments, it will 
be apparent to those skilled in the art that various changes and modifications may be made which 
clearly fall within the scope of the invention. The invention is intended to be protected broadly 
within the spirit and scope of the appended claims. 
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